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Release 3.1A John F. Collins, Biocomputing Research Unit. 
Copyright (c) 1993*1998 University of Edinburgh, O.K. 
Distribution rights by Oxford Molecular Ltd 

MPsrch_nn n.a. - n.a. database search, using Smith -Waterman algorithm 

Run on: Fri Sep 11 06:57:38 1998; MasPar time 61.54 Seconds 

1182.505 Million cell updates/sec 

Tabular output not generated. 

Title: MJS-08-794-795-5 

Description: (1-1560) from US08794795 .seq 

Perfect Score: 1560 

N.A. Sequence: 1 ATGAGAAATAAGAAAATTCT CAGGCGTGGAGTGCAGCGTC 1560 

Comp : TACTCTTTATTCTTTTAAGA GTCCGCACCTCACGTCGCAG 

Scoring table: table default 
Gap 6 

Nmatch STD : Dbase 0; Query 0 

Searched: 88822 seqs, 23323279 bases x 2 



Post-processing: Minimum Match 0% 

Listing first 45 summaries 



Database: 



Statistics: 



n -issued 

1:5_C0MB 2:PCT9_COMB 3:backfilesl 
Mean 8.943; Variance 5.183; scale 1.726 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



NO. 


Score 


Match Length DB 


ID 


Description 


Pred. No. 


1 


666 


42.7 


1868 


1 


US-08-392- 


Sequence 1, Applicatio 


0.00e+00 


2 


133 


8.5 


7218 


1 


US-08-232- 


Sequence 14, Applicati 


1.70e-72 


3 




2.5 


215 


1 


US-08-238* 


Sequence 5, Applicatio 


1.15e-09 


4 




2.4 


1367 


2 


PCT-US96-0 


Sequence 3, Applicatio 


1.72e-08 


5 




2.4 


1367 


1 


US-08-453- 


Sequence 3, Applicatio 


1.72e-08 


6 




2.4 


1839 


2 


PCT-US96-0 


Sequence 1, Applicatio 


1.72e-08 


7 




2.4 


2028 


1 


US-Q8-453- 


Sequence 1, Applicatio 


1.72e-08 


8 




2.4 


2028 


2 


PCT-US96-0 


Sequence 1, Applicatio 


1.72e-08 


9 




2.4 


2037 


1 


US-08-154- 


Sequence 1, Applicatio 


1.72e-08 


10 




2.3 


1588 


3 


5510466-3 


Patent No. 5510466. 


6.57e-08 


11 




2.2 


1196 


1 


US-08-167- 


Sequence 5, Applicatio 


2.48e-07 


12 




2.2 


1807 


3 


5510466-1 


Patent No. 5510466. 


2.48e-07 


13 




2.1 


186 


1 


US-07-972- 


Sequence 80, Applicati 


3.41e-06 


14 




2.1 


215 


1 


US-08-238- 


Sequence 5, Applicatio 


3.41e-06 


15 




2.1 


1416 


1 


US-07-621- 


Sequence 1, Applicatio 


1.24e-05 


16 




2.1 


7218 


1 


US-08-232- 


Sequence 14, Applicati 


3.41e-06 


17 




2.0 


123 


1 


US-07-609- 


Sequence 58, Applicati 


4.45e-05 


18 




2.0 


169 


1 


US-07-972- 


Sequence 78, Applicati 


4.45e-05 


19 




1.9 


2285 


1 


US-08-477- 


Sequence 9, Applicatio 


5.49e-04 


20 


28 


1.8 


216 


1 


US-G7-972- 


Sequence 46, Applicati 


1.88e-03 



21 


28 


1.8 


3394 


1 


US-08-159- 


Sequence 


4, Applicatio 


1.88e-03 


22 


28 


1.8 


4031 


1 


US-08-159- 


Sequence 


1, Applicatio 


1.68e-03 


23 


26 


1.7 


105 


1 


US-07-865- 


Sequence 


13, Applicati 


2.10e-02 


24 


26 


1.7 


108 


1 


US-07-972* 


Sequence 


30, Applicati 


2.10e-02 


25 


27 


1.7 


186 


1 


US-07-609* 


Sequence 


62, Applicati 


6.35e-03 


26 


27 


1.7 


186 


1 


US-08-175- 


Sequence 


66, Applicati 


6.35e*03 


27 


27 


1.7 


226 


1 


US-08-175- 


Sequence 


63, Applicati 


6.35e-03 


28 


27 


1.7 


4360 


1 


US-08-470- 


Sequence 


1, Applicatio 


6.35e-03 


29 


25 


1.6 


74 


2 


PCT-US95-1 


Sequence 


100, Applicat 


6.82e*02 


30 


25 


1.6 


81 


2 


PCT-US95-1 


Sequence 


98, Applicati 


6.82e-02 


31 


25 


1.6 


81 


2 


PCT-US95-1 


Sequence 


92, Applicati 


6.82e-02 


32 


25 


1.6 


82 


2 


PCT-US95-1 


Sequence 


97, Applicati 


6.82e-02 


33 


25 


1.6 


242 


1 


US-08-273- 


Sequence 


1, Applicatio 


6.82e*02 


34 


23 


1.5 


65 


1 


US-08-471- 


Sequence 


145, Applicat 


6.72e-01 


35 


24 


1.5 


66 


1 


US-08-471- 


Sequence 


144, Applicat 


2.17e-01 


36 


24 


1,5 


69 


1 


US-08-471- 


Sequence 


142, Applicat 


2.17e-01 


37 


24 


1.5 


74 


2 


PCT-US95-1 


Sequence 


94, Applicati 


2.17e-01 


38 


24 


1.5 


74 


2 


PCT-US95-1 


Sequence 


94, Applicati 


2.17e-01 


39 


24 


1.5 


75 


2 


PCT-US95-1 


Sequence 


99, Applicati 


2.17e-01 


40 


24 


1.5 


75 


2 


PCT-US95-1 


Sequence 


99, Applicati 


2.17e-01 


41 


24 


1.5 


81 


2 


PCT-US95-1 


Sequence 


92, Applicati 


2.17e-01 


42 


24 


1.5 


82 


2 


PCT-US95-1 


Sequence 


97, Applicati 


2.17e-01 


43 


24 


1.5 


162 


1 


US-07-972- 


Sequence 


34, Applicati 


2.17e-01 


44 


24 


1.5 


162 


1 


US-07-972- 


Sequence 


36, Applicati 


2.17e-01 


45 


24 


1.5 


506 


1 


US-08-253- 


Sequence 


61, Applicati 


2.17e-01 



Sequence 1, Application US/08392367B 
Sequence 1, Application US/08392367B 
Patent No. 5691197 
GENERAL INFORMATION: 
APPLICANT: Tryggvason, Karl 
APPLICANT: Elomaa, Outi 
APPLICANT: Kangas, Maarit 

TITLE OF INVENTION: An Insolated DNA Sequence For a 
Patent No. 5691197 
TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 
TITLE OF INVENTION 
NUMBER OF SEQUENCES : 



No. 5691197el Macrophage Receptor with 
a Collagenous Domain and the 
Polypeptide Chain Encoded by 
such a Sequence 
2 



ALIGNMENTS 

RESULT 1 

ID US-08-392-367B-1 STANDARD; DNA; UNC; 1868 BP. 
AC xxxxxx 
DT 
DE 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



CORRESPONDENCE ADDRESS: 
ADDRESSEE: Fay, Sharpe, Beall, Fagan, 
ADDRESSEE: Minnich & McKee 
STREET: 1100 Superior Avenue 
STREET: Suite 700 
CITY: Cleveland 
STATE: Ohio 
COUNTRY: U.S.A. 
ZIP: 44114-2518 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette, 3.50 inch, 
MEDIUM TYPE: 720 Kb storable 
COMPUTER: IBM PS/2, Model 35 SX 
OPERATING SYSTEM: DOS 5.0 
SOFTWARE: Word Perfect 5.1 
CURRENT APPLICATION DATA: 
APPLICATION NUMBER: US/08/392, 367B 
FILING DATE: 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: Minnich, Richard J. 
REGISTRATION NUMBER: 24,175 
REFERENCE/DOCKET NUMBER: TRV 2 009 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (216) 861-5582 
TELEFAX: (216) 241-1666 
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CC TELEX: (216) 980162 

CC INFORMATION FOR SEQ ID NO: 1: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1868 base pairs 

CC TYPE: Nucleic acid 

CC STRANDEDNESS: Single 

CC TOPOLOGY: Linear 

CC MOLECULE TYPE: Nucleotide -genomic DNA 

CC HYPOTHETICAL: No. 5691197 relevant 

CC ANTI-SENSE: No. 5691197 relevant 

SQ SEQUENCE 1868 BP; 524 A; 433 C; 576 G; 335 T; 0 OTHER, 

Query Match 42.7%; Score 666; DB 1; Length 1868; 
Best Local Similarity 75. U; Pred. No. O.OOe+00; 



Matches 


1110; Conservative 0; Mismatches 354; Indels 15; Gaps 


Db 


247 


ATGGAGACCTTCGAAATCAATGATCCAGTGCCCAAGAAGAGAAATGGGGGGACCTTCTGC 


306 






nun i iiimiiini! mi minim immiimm ii imm i 




Ov 


76 


ATGGAGCCTTTCGAAATCAATGTTCCAAAGCCCAAGAGGAGAAATGGGGTGAACTTCTCC 


135 


Db 


307 


ATGGCAGTCATGGCCATCCACCTGATCCTGCTCACGGCAGGTACTGCACTGCTGCTGATT 


366 






1 II 11 III MM llllllllllllllll II II III llllll II 1 




Ov 


136 


CTAGCTGTGGTGGTCATCTACCTGATCCTGCTCACCGCTGGCGCTGGGCTGCTGGTGGTC 


195 


Db 


367 


CAAGTTCTCAATCTGCAGGAGCAGCTCCAGATGCTAGAGATGTGCTGTGGCAATGGATCA 


426 






llllllll MIMIIMI II Mill 1 1 II llllll 1 Mill 1 




Ov 


196 


CAAGTTCTGAATCTGCAGGCGCGGCTCCGGGTCCTGGAGATGTATTTCCTCAATGACACT 


255 


Db 


427 


CTAGCTATCGAGGACAAGCCCTTCTTCTC—GCTGCAGTGGGCAC-CCAAAACA-CAC 


480 






II M lllllll II 1 llllll llllllll Mil II 1 1 III 




Qv 


256 


CTGGCGGCTGAGGACAGCCCGTCCTTCTCCTTGCTGCAGTCAGCACACCCTGGAGAACAC 


315 


Db 


481 


CTGGTACCTAGAGCACAGGGGCTGCAAGCCTTGCAGGCCCAGCTCAGCTGGGTCCATACC 


540 






MM 1 1 MM IMIIIMI 1 MINIUM Mil llllllll 1 




Qy 


316 


CTGGCTCAGGGTGCATCGAGGCTGCAAGTCCTGCAGGCCCAACTCACCTGGGTCCGCGTC 


375 


Db 


541 


AGCCAGGAGCAACTCCGTCAGCAGTTCAACAACCTCACTCAAAATCCAGAGTTGTTCCAG 


600 






Mill Mill 1 1 MM 1 1 Mill lllllll II MM 1 INN 




Qy 


376 


AGCCATGAGCACTTGCTGCAGCGGGTAGACAACTTCACTCAGAACCCAGGGATGTTCAGA 


435 


Db 


601 


ATTAAAGGTGAACGAGGCTCTCCAGGTC-CAA AAGGGGCCCCGGGTGCT *CCT 


651 






ii i i i i * i i t i i lilt i i t i * i i t ill iiiiii ii ii i lit 

II MIMIIMI Nil 1 MINN IN llllll II II 1 III 




Qy 


436 


ATCAAAGGTGAACAAGGCGCCCCAGGTCTTCAAGGTCACAAGGGGGCCATGGGCATGCCT 


495 


Db 


652 


GGAATCCCCGGGCTGCCTGGGCCAGCTGCTGAGAAGGGAGAAAAGGGGGCTGCAGGTCGT 


711 






1 1 mi ii i mi ii mi 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii ii 
II III II 1 IN II III MINI II lllllll MINIMI II II 




Qy 


496 


GGTGCCCCTGGCCCGCCGGGACCACCTGCTGAGAAGGGAGCCAAGGGGGCTATGGGACGA 


555 


DD 


712 


GATGGAACCCCAGGTGTCCAAGGACCCCAGGGCCCACCAGGCAGCAAGGGAGAGGCAGGC 


771 






llllll 1 MM 1 llllllll llllllll II MINIMUM III 




Qy 


556 


GATGGAGCAACAGGCCCCTCGGGACCCCAAGGCCCACCGGGAGTCAAGGGAGAGGCGGGC 


615 


Db 


772 


CTCCAGGGACTTACGGGTGCACCAGGGAAGCAAGGAGCAACTGGTGCTCCAGGACCTCGA 


831 






Mill Mil llllll MINIMMIMMM INN 1 MINIM 1 1 




Qy 


616 


CTCCAAGGACCCCAGGGTGCTCCAGGGAAGCAAGGAGCCACTGGCACCCCAGGACCCCAA 


675 


Db 


832 


GGAGAGAAGGGCAGCAAAGGTGACATAGGTCTCACTGGCCCCAAGGGGGAACATGGCACC 


891 






IIIMINIIIIIIIIIMI II lllllll llllll II MINI IN II 




Qy 


676 


GGAGAGAAGGGCAGCAAAGGCGATGGGGGTCTCATTGGCCCAAAAGGGGAAACTGGAACT 


735 


Db 


892 


AAGGGAGACAAAGGGGACCTAGGCCTTCCAGGAAACAAAGGGGACATGGGCATGAAGGGA 


951 






llllllll Mill Mill II II lllllll IMMIIMM MINIMI III 




Qy 


736 


AAGGGAGAGAAAGGAGACCTGGGTCTCCCAGGAAGCAAAGGGGACAGGGGCATGAAAGGA 


795 


Db 


952 


GACACGGGGCCCATGGGGTCCCCTGGAGCTCAGGGAGGTAAAGGTGATGCTGGAAAACCA 


1011 






II 1 Ml lllllll 1 MM IMM MINIUM II 1 III 




Qy 


796 


GATGCAGGGGTCATGGGGCCTCCTGGAGCCCAGGGGAGTAAAGGTGACTTCGGGAGGCCA 


855 


Db ■ 


' 1012 


GGCCTACCAGGTTTGGCTGGATCTCCAGGAGTCAAAGGTGACCAAGGAAAACCTGGAGTG 


1071 






MM MIMIMMIIMI 1 III Nil INN II It 1 1 1 1 llllllll II 




Qy 


856 


GGCCCACCAGGTTTGGCTGGTTTTCCTGGAGCTAAAGGAGATCAAGGACAACCTGGACTG 


915 


Db 


1072 


CAGGGTGTTCCAGGCCCTCAAGGTGCACCAGGACTTTCAGGTGCCAAGGGTGAGCCAGGA 


1131 



Qy 916 CAGGGTGTTCCGGGCCCTCCTGGTGCAGTGGGACACCCAGGTGCCAAGGGTGAGCCTGGC 975 

Db 1132 CGCACTGGTCTTCCTGGGCCAGCAGGACCCCCGGGAATTGCTGGGAATCCAGGGATTGCA 1191 

I Mil lllllll HI II II I I Nil MINI M 

Qy 976 AGTGCTGGCTCCCCTGGGCGAGCAGGACTTCCAGGGAGCCCCGGGAGTCCAGGAGCCACA 1035 

Db 1192 GGTGTGAAAGGAAGCAAGGGTGACACAGGAATTCAAGGACAGAAAGGCACAAAAGGAGAA 1251 

II T 1 1 1 1 F 1 1 M 1 1 1 II lllllllll 1 1 ■ 1 1 1 1 1 1 Nil I IMM 

Qy 1036 GGCCTGAAAGGAAGCAAAGGGGACACAGGACTTCAAGGACAGCAAGGAAGAAAAGGAGAA 1095 

Db 1252 TCAGGAGTCCCAGGTCTTGTAGGCAGAAAGGGAGACACTGGAAGCCCTGGGCTGGCAGGT 1311 

1 1 J 1 1 1 1 1 Mill I II III llllllll II INN MINIMUM 

Qy 1096 TCAGGAGTTCCAGGCCCTGCAGGTGTGAAGGGAGAACAGGGGAGCCCAGGGCTGGCAGGT 1155 

Db 1312 CCCAAAGGAGAACCTGGACGAGTCGGTCAGAAGGGAGACCCGGGGATGAAAGGGTCTTCT 1371 

IMM MM Mill || || NIMMMMM III lllllll llllll 
Qy 1156 CCCAAGGGAGCCCCTGGACAAGCTGGCCAGAAGGGAGACCAGGGAGTGAAAGGATCTTCT 1215 

Db 1372 GGCCAGCAAGGACAAAAGGGAGAAAAGGGTCAAAAAGGCGAATCTTTCCAACGCGTCCGG 1431 

II Mill INMNMIM III III III III I MM II 

Qy 1216 GGGGAGCAAGGAGTAAAGGGAGAAAAAGGTGAAAGAGGTGAAAACTCAGTGTCCGTCAGG 1275 

Db 1432 ATCATGGGTGGCACCAACAGAGGCCGAGCTGAAGTTTACTATAACAATGAGTGGGGGACA 1491 

lllllll III MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I MINIMI 
Qy 1276 ATTGTCGGCAGTAGTAACCGAGGCCGGGCTGAAGTTTACTACAGTGGTACCTGGGGGACA 1335 

Db 1492 ATTTGTGATGATGATTGGGATAATAATGATGCGACTGTCTTCTGTCGCATGCTCGGTTAC 1551 

Mill Mill II IN I IN MINI I lllllllll llllllll llllll 
Qy 1336 ATTTGCGATGACGAGTGGCAAAATTCTGATGCCATTGTCTTCTGCCGCATGCTGGGTTAC 1395 

Db 1552 TCCAGAGGGAGAGCACTTAGCAGCTATGGAGGTGGCTCTGGGAACATCTGGCTGGACAAT 1611 
MM III II II II II Nil Nil Mill I IMMIIMM III 

Qy 1396 TCCAAAGGAAGGGCCCTGTACAAAGTGGGAGCTGGCACTGGGCAGATCTGGCTGGATAAT 1455 

Db 1612 GTGAATTGTCGGGGCACAGAGAACAGTTTGTGGGACTGCAGTAAGAACTCCTGGGGCAAT 1671 

II I Ml! M MM I Mill Mill Mill lllllll II 
Qy 1456 GTTCAGTGTCGGGGCACGGAGAGTACCCTGTGGAGCTGCACCAAGAATAGCTGGGGCCAT 1515 

Db 1672 CACAATTGCGTACATAATGAAGATGCGGGTGTGGAATGC 1710 

II I III II I II II II II Mill III 
Qy 1516 CATGACTGCAGCCACGAGGAGGACGCAGGCGTGGAGTGC 1554 



RESULT 2 

ID US-08-232-463-14 STANDARD; DNA; UNC; 7218 BP. 

AC xxxxxx 

DT 

DE Sequence 14, Application US/08232463 

CC Sequence 14, Application US/08232463 

CC Patent No. 5670367 

CC GENERAL INFORMATION: 

CC APPLICANT: DORNER, F. 

CC APPLICANT: SCHEIFLINGER, F. 

CC APPLICANT ; FALKNER, F. G. 

CC TITLE OF INVENTION: RECOMBINANT FOWLPOX VIRUS 

CC NUMBER OF SEQUENCES: 52 

CC CORRESPONDENCE ADDRESS : 

CC ADDRESSEE: Foley G Lardner 

CC STREET: 1800 Diagonal Road, Suite 500 

CC CITY: Alexandria 

CC STATE: VA 

CC COUNTRY: USA 

CC ZIP: 22313-0299 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1.0, version #1.25 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/232,463 

CC FILING DATE: 

CC CLASSIFICATION: 435 
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cc 


PRIOR APPLICATION DATA: 


cc 


APPLICATION NUMBER: US/07/935,313 


cc 


FILING DATE: 


cc 


APPLICATION NUMBER: EP 91 114 300,6 


cc 


FILING DATE: 26-AUG-1991 


cc 


ATTORNEY/AGENT INFORMATION: 


cc 


NAME: BENT, Stephen A, 


cc 


REGISTRATION NUMBER: 29,768 


cc 


REFERENCE/DOCKET NUMBER: 30472/114 IMMU 


cc 


TELECOMMUNICATION INFORMATION: 


cc 


TELEPHONE: (703)836-9300 


cc 


TELEFAX: (703)683*4109 


cc 


TELEX: 899149 


cc 


INFORMATION FOR SEQ ID NO: 14: 


cc 


SEQUENCE CHARACTERISTICS; 


cc 


LENGTH: 7218 base pairs 


cc 


TYPE: nucleic acid 


cc 


STRANDEDNESS: Single 


cc 


TOPOLOGY: linear 


cc 


IMMEDIATE SOURCE: 


cc 


CLONE: pTZgpt-FlS 



SQ SEQUENCE 7218 BP; 1944 A; 1491 C; I486 G; 1929 T; 368 OTHER. 

Query Match 8,5%; Score 133; DB 1; Length 7218; 

Best Local Similarity 0,5%; Pred. No. 1.70e-72; 

Matches 2; Conservative 249; Mismatches 118; Indels 0; Gaps 0; 

Db 1065 ATYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY 1124 

Cp 1359 ATTTTGCCACTCGTCATCGCAAATTGTCCCCCAGGTACCACTGTAGTAAACTTCAGCCCG 1300 

Db 1125 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY 1184 

Cp 1299 GCCTCGGTTACTACTGCCGACAATCCTGACGGACACTGAGTTTTCACCTCTTTCACCTTT 1240 

Db 1185 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY 1244 

Cp 1239 TTCTCCCTTTACTCCTTGCTCCCCAGAAGATCCTTTCACTCCCTGGTCTCCCTTCTGGCC 1180 

Db 1245 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY 1304 

Cp 1179 AGCTTGTCCAGGGGCTCCCTTGGGACCTGCCAGCCCTGGGCTCCCCTGTTCTCCCTTCAC 1120 

Db 1305 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY 1364 

Cp 1119 ACCTGCAGGGCCTGGAACTCCTGATTCTCCTTTTCTTCCTTGCTGTCCTTGAAGTCCTGT 1060 

Db 1365 YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY 1424 

Cp 1059 GTCCCCTTTGCTTCCTTTCAGGCCTGTGGCTCCTGGACTCCCGGGGCTCCCTGGAAGTCC 1000 

Db 1425 YYYYYYYYY 1433 

Cp 999 TGCTCGCCC 991 



RESULT 3 

ID US-08-238-163-5 STANDARD; DNA; UNC; 215 BP. 

AC XXXXXX 

DT 

DE Sequence 5, Application US/08238163 

CC Sequence 5, Application US/08238163 

CC Patent No. 5569830 

CC GENERAL INFORMATION: 

CC APPLICANT: BENNETT, Alan 

CC APPLICANT: LABAVITCH, John M. 

CC APPLICANT: POWELL, Ann 

CC APPLICANT: STOTZ, Henrik 

CC TITLE OF INVENTION: PLANT INHIBITORS OF FUNGAL 

CC TITLE OF INVENTION: POLYGALACTURONASES AND THEIR USE TO CONTROL FUNGAL DISEASE 

CC NUMBER OF SEQUENCES: 24 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Townsend and Townsend Khourie and Crew 



CC STREET: Steuart Street Tower, One Market Plaza 

CC CITY: San Francisco 

CC STATE: California 

CC COUNTRY: US 

CC ZIP: 94105-1493 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release tl.0, Version #1.25 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/238,163 

CC FILING DATE: 03-MAY-1994 

CC CLASSIFICATION: 800 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Bastian, Kevin L. 

CC REGISTRATION NUMBER: 34,774 

CC REFERENCE/DOCKET NUMBER: 2307E-540 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 543-9600 

CC TELEFAX: (415) 543-5043 

CC INFORMATION FOR SEQ ID NO: 5: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 215 base pairs 

CC TYPE; nucleic acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: unknown 

CC MOLECULE TYPE: protein 

CC FEATURE: 

CC NAME/KEY: misc_feature 

CC LOCATION: 1..215 

CC OTHER INFORMATION: /standard.name* "Deduced amino acid 

CC OTHER INFORMATION: sequence of PGIP from bean." 

SQ SEQUENCE 215 BP; 15 A; 8 C; 25 G; 26 T; 141 OTHER. 

Query Match 2.5%; Score 39; DB 1; Length 215; 

Best Local Similarity 13.4%; Pred. No. 1.15e-09; 

Matches 23; Conservative 74; Mismatches 74; Indels 1; Gaps 1; 

Db 6 MSSSSWSRTASCNDKAKKDGNTTSSWTTDCCNRTWGVCDTDTTYRVNNDSGHNKYSSAN 65 

Qy 1138 AGCCCAGGGCTGGCAGGTCCCAAGGGAGCCCCTGGACAAGCTGGCCAGAAGGGAGACCAG 1197 

Db 66 YNYGGNNVGAAKTHYYTHTNVSGADSKTVTDSYNASGTSSSNGGTDGNRSGADSYGSSKT 125 

I : | : :::|:| ::||:: : ::: | | : : :| : |:: |: : 

Qy 1198 GGAGTGAA-AGGATCTTCTGGGGAGCAAGGAGTAAAGGGAGAAAAAGGTGAAAGAGGTGA 1256 

Db 126 AMTSRNRTGKTANNAVDSRNMGDASVGSDKNTKKHAKNSADGKVGSKNNGDR 177 

I: : : :| :::: : ::|::: :::| :|:| :|:: |:: 

Qy 1257 AAACTCAGTGTCCGTCAGGATTGTCGGCAGTAGTAACCGAGGCCGGGCTGAA 1308 



RESULT 4 

ID PCT-US96-08081-3 STANDARD; DNA; UNC; 1367 BP, 

AC XXXXXX 

DT 

DE Sequence 3, Application PC/TUS9608081 

CC Sequence 3, Application PC/TUS9608081 

CC GENERAL INFORMATION: 

CC APPLICANT: SmithKline Beecham Corporation 

CC TITLE OF INVENTION: Attachment Enhanced 293 Cells 

CC NUMBER OF SEQUENCES: 4 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: SmithKline Beecham - Corporate Patents U.S. 

CC STREET: Mailcode - UW2220, 709 Swedeland Road 

CC CITY: King of Prussia 

CC STATE: Pennsylvania 

CC COUNTRY: U.S. A, 

CC ZIP: 19406-5090 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 
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CC SOFTWARE: Patentln Release #1,0, Version #1.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: PCT/US96/08081 

CC FILING DATE: 

CC CLASSIFICATION: 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Jervis, Herbert H, 

CC REGISTRATION NUMBER: 31,171 

CC REFERENCE/DOCKET NUMBER: P50338 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (610) 270-5019 

CC TELEFAX: (610) 270*5090 

CC INFORMATION FOR SEQ ID NO: 3: 

CC SEQUENCE CHARACTERISTICS: 

CC length: 1367 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS : double 

CC TOPOLOGY: not relevant 

CC MOLECULE TYPE: cDNA to mRNA 

CC FEATURE: 

CC NAME/KEY: CDS 

CC LOCATION: 67.. 1143 

SQ SEQUENCE 1367 BP; 427 A; 298 C; 303 G; 339 T; 0 OTHER. 

Query Match 2.4%; Score 37; DB 2; Length 1367; 

Best Local Similarity 60.3%; Pred. No, l,72e-0B; 

Matches 108; Conservative 0; Mismatches 71; Indels 0; Gaps 0; 

Db 898 CCGGGTGAAAAAGGAGATCGAGGTCCCACTGGAGAAAGTGGTCCACGAGGATTTCCAGGT 957 

II II I MINIUM III I Mill III I II III III 
Qy 880 CCTGGAGCTAAAGGAGATCAAGGACAACCTGGACTGCAGGGTGTTCCGGGCCCTCCTGGT 939 

Db 958 CCAATAGGTCCTCCGGGTCTTAAAGGTGATCGGGGAGCAATTGGCTTTCCTGGAAGTCGA 1017 

II I II I II III II Mill I II Mill Mill I I 
Qy 940 GCAGTGGGACACCCAGGTGCCAAGGGTGAGCCTGGCAGTGCTGGCTCCCCTGGGCGAGCA 999 

Db 1018 GGACTCCCAGGATATGCCGGAAGGCCAGGAAATTCTGGACCAAAAGGCCAGAAAGGGGA 1076 

Mill Mill MM M llllll I II I Mill IMMIII 
Qy 1000 GGACTTCCAGGGAGCCCCGGGAGTCCAGGAGCCACAGGCCTGAAAGGAAGCAAAGGGGA 1058 



RESULT 5 

ID US-08-453-117-3 STANDARD; DNA; UNC; 1367 BP. 

AC xxxxxx 

DT 

DE Sequence 3, Application US/08453117 

CC Sequence 3, Application US/08453117 

CC Patent No. 5683903 

CC GENERAL INFORMATION: 

CC APPLICANT: Lysko, Paul G. 

CC APPLICANT: Elshourbagy, Nabil A. 

CC APPLICANT: Brawner, Mary E. 

CC TITLE OF INVENTION: Attachment Enhanced 293 Cells 

CC NUMBER OF SEQUENCES: 4 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: SmithKline Beecham • Corporate Patents 

CC ADDRESSEE: U.S. 

CC STREET : Mailcode * UW2220, 709 Svedeland Road 

CC CITY: King of Prussia 

CC STATE: Pennsylvania 

CC COUNTRY: U.S.A. 

CC ZIP: 19406-5090 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC -DOS/MS -DOS 

CC SOFTWARE: Patentln Release 11. 0, version #1.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/453,117 

CC FILING DATE: 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Jervis, Herbert H. 



CC REGISTRATION NUMBER: 31,171 

CC REFERENCE/DOCKET NUMBER: SBC-P50338 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (610) 270-5019 

CC TELEFAX: (610) 270-5090 

CC INFORMATION FOR SEQ ID NO: 3: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1367 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: double 

CC TOPOLOGY: not relevant 

CC MOLECULE TYPE: CDNA to mRNA 

CC FEATURE: 

CC NAME/KEY: CDS 

CC LOCATION: 67.. 1143 

SQ SEQUENCE 1367 BP; 427 A; 298 C; 303 G; 339 T; 0 OTHER. 

Query Match 2.4%; Score 37; DB 1; Length 1367; 

Best Local Similarity 60.3%; Pred. No, 1.72e-08; 

Matches 108; Conservative 0; Mismatches 71; Indels 0; Gaps 0 

Db 898 CCGGGTGAAAAAGGAGATCGAGGTCCCACTGGAGAAAGTGGTCCACGAGGATTTCCAGGT 957 

II II I MMIIIMI III I Mill III I II Ml Ml 
Qy 880 CCTGGAGCTAAAGGAGATCAAGGACAACCTGGACTGCAGGGTGTTCCGGGCCCTCCTGGT 939 

Db 958 CCAATAGGTCCTCCGGGTCTTAAAGGTGATCGGGGAGCAATTGGCTTTCCTGGAAGTCGA 1017 

llllll II III II Mill I II Mill Mill I I 
Qy 940 GCAGTGGGACACCCAGGTGCCAAGGGTGAGCCTGGCAGTGCTGGCTCCCCTGGGCGAGCA 999 

Db 1018 GGACTCCCAGGATATGCCGGAAGGCCAGGAAATTCTGGACCAAAAGGCCAGAAAGGGGA 1076 

Mill Mill MM II llllll I II I Mill MIMIM 
Qy 1000 GGACTTCCAGGGAGCCCCGGGAGTCCAGGAGCCACAGGCCTGAAAGGAAGCAAAGGGGA 1058 



RESULT 6 

ID PCT-US96-01427-1 STANDARD; DNA; UNC; 1839 BP. 

AC xxxxxx 

DT 

DE Sequence 1, Application PC/TUS9601427 
CC Sequence 1, Application PC/TUS9601427 
CC GENERAL INFORMATION: 

CC APPLICANT: Greene, Mark I. 

CC APPLICANT: Davis, James G. 

CC TITLE OF INVENTION: Saccular collagen and Compositions and 

CC TITLE OF INVENTION: Methods for Making and Using the Same 

CC NUMBER OF SEQUENCES: 2 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Woodcock Washburn Kurtz Mackiewicz & Norris 
CC STREET: One Liberty Place, 46th floor 
CC CITY: Philadephia 

CC STATE: PA 

CC COUNTRY: USA 

CC ZIP: 19103 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: WordPerfect 5.1 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: PCT/US96/01427 

CC FILING DATE: 

CC CLASSIFICATION: 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 08/383,744 

CC FILING DATE: 02-FEB-1995 

CC CLASSIFICATION; 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: DeLuca, Mark 

CC REGISTRATION NUMBER: 33,229 

CC REFERENCE/DOCKET NUMBER: UPN-2653 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: 215-568*3100 

CC TELEFAX: 215-568-3439 
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CC INFORMATION FOR SEQ ID NO; 1: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1839 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: double 

CC TOPOLOGY: both 

CC MOLECULE TYPE: CDNA 

CC FEATURE: 

CC NAME/KEY: CDS 

CC LOCATION: 331.. 1602 

SQ SEQUENCE 1839 BP; 461 A; 444 C; 541 G; 393 T; 0 OTHER. 

Query Match 2.4%; Score 37; DB 2; Length 1839; 

Best Local Similarity 58.8%; Pred, No. l,72e-Q8; 

Matches 124; Conservative 0; Mismatches 87; Indels 0; Gaps 0; 

Db 555 GGGTCCTCTAGGGTTACCAGGGGAGAAGGGAGAGACAGGGCTCAGAGGACCTCCAGGACC 614 

III II I III III II I MM I II III lllll! I II I 
Qy 576 GGGACCCCAAGGCCCACCGGGAGTCAAGGGAGAGGCGGGCCTCCAAGGACCCCAGGGTGC 635 

Db 615 AGCAGGTCTACCTGGAGCCAATGGACTCAATGGCGACATAGGTGAAAAAGGTGATCAAGG 674 

nil i mini in i n i in n n n nn 

Qy 636 TCCAGGGAAGCAAGGAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGGGCAGCAAAGG 695 

Db 675 ACCGGTGGGTCTTCCTGGTGTCCCTGGGATCCCAGGAAAACCAGGAGAGAAAGGTGATCC 734 

I Mill' HI III I llll Mill II I 
Qy 696 CGATGGGGGTCTCATTGGCCCAAAAGGGGAAACTGGAACTAAGGGAGAGAAAGGAGACCT 755 

Db 735 AGGCCTCAAAGGAGATAAAGGTGAACGTGGC 765 

II III llll lllll II I III 
Qy 756 GGGTCTCCCAGGAAGCAAAGGGGACAGGGGC 786 



RESULT 7 

ID US-08-453-117-1 STANDARD; DNA; UNC; 2028 BP, 

AC xxxxxx 

DT 

DE Sequence 1, Application US/08453117 

CC Sequence 1, Application US/08453117 

CC Patent No, 5683903 

CC GENERAL INFORMATION: 

CC APPLICANT: Lysko, Paul G. 

CC APPLICANT: Elshourbagy, Nabil A. 

CC APPLICANT: Brawner, Mary E. 

CC TITLE OF INVENTION: Attachment Enhanced 293 Cells 

CC NUMBER OF SEQUENCES: 4 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: SmithKline Beecham * Corporate Patents 

CC ADDRESSEE: U.S. 

CC STREET: Mailcode - UW222Q, 709 Swedeland Road 

CC CITY: King of Prussia 

CC STATE: Pennsylvania 

CC COUNTRY: U.S.A. 

CC ZIP: 19406-5090 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1.0, Version #1.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/453,117 

CC FILING DATE: 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Jervis, Herbert H. 

CC REGISTRATION NUMBER: 31,171 

CC REFERENCE/DOCKET NUMBER: SBC-P50338 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (610) 270-5019 

CC TELEFAX: (610) 270-5090 

CC INFORMATION FOR SEQ ID NO: 1: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 2028 base pairs 



CC TYPE: nucleic acid 

CC STRANDEDNESS: double 

CC TOPOLOGY: not relevant 

CC MOLECULE TYPE: CDNA to mRNA 

CC FEATURE: 

CC NAME/KEY: CDS 

CC LOCATION: 47.. 1402 

SQ SEQUENCE 2028 BP; 653 A; 369 C; 434 G; 572 T; 0 OTHER. 

Query Match 2,4%; Score 37; DB 1; Length 2028; 

Best Local Similarity 60.3%; Pred. No. 1.72e-0B; 

Matches 108; Conservative 0; Mismatches 71; Indels 0; Gaps 0; 

Db 878 CCGGGTGAAAAAGGAGATCGAGGTCCCACTGGAGAAAGTGGTCCACGAGGATTTCCAGGT 937 

II II I llllllllll III I lllll III I II III III 
Qy 880 CCTGGAGCTAAAGGAGATCAAGGACAACCTGGACTGCAGGGTGTTCCGGGCCCTCCTGGT 939 

Db 938 CCAATAGGTCCTCCGGGTCTTAAAGGTGATCGGGGAGCAATTGGCTTTCCTGGAAGTCGA 997 

II I II I II III II lllll I II lllll Mill I I 
Qy 940 GCAGTGGGACACCCAGGTGCCAAGGGTGAGCCTGGCAGTGCTGGCTCCCCTGGGCGAGCA 999 

Db 998 GGACTCCCAGGATATGCCGGAAGGCCAGGAAATTCTGGACCAAAAGGCCAGAAAGGGGA 1056 

iiiii iiiii iiii ii linn i ii i inn mum 

Qy 1000 GGACTTCCAGGGAGCCCCGGGAGTCCAGGAGCCACAGGCCTGAAAGGAAGCAAAGGGGA 1058 



RESULT 8 

ID PCT-US96-0808M STANDARD; DNA; UNC; 2028 BP. 

AC xxxxxx 

DT 

DE Sequence 1, Application PC/TUS9608081 

CC Sequence 1, Application PC/TUS9608081 

CC GENERAL INFORMATION: 

CC APPLICANT: SmithKline Beecham Corporation 

CC TITLE OF INVENTION: Attachment Enhanced 293 Cells 

CC NUMBER OF SEQUENCES: 4 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: SmithKline Beecham - Corporate Patents U.S. 

CC STREET: Mailcode * UW2220, 709 Swedeland Road 

CC CITY: King of Prussia 

CC STATE: Pennsylvania 

CC COUNTRY: U.S.A. 

CC ZIP: 19406-5090 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC'DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1.0, Version #1.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: PCT/US96/08081 

CC FILING DATE: 

CC CLASSIFICATION: 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Jervis, Herbert H. 

CC REGISTRATION NUMBER: 31,171 

CC REFERENCE/DOCKET NUMBER: P50338 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (610) 270-5019 

CC TELEFAX: (610) 270-5090 

CC INFORMATION FOR SEQ ID NO: 1: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 2028 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: double 

CC TOPOLOGY: not relevant 

CC MOLECULE TYPE: CDNA to mRNA 

CC FEATURE: 

CC NAME/KEY: CDS 

CC LOCATION: 47.. 1402 

SQ SEQUENCE 2028 BP; 653 A; 369 C; 434 6; 572 T; 0 OTHER, 

Query Match 2.4%; Score 37; DB 2; Length 2028; 
Best Local Similarity 60.3%; Pred. No. 1.72e-08; 
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Matches 108; Conservative 0; Mismatches 71; Indels 0; Gaps 0; 
Db 878 CCGGGTGAAAAAGGAGATCGAGGTCCCACTGGAGAAAGTGGTCCACGAGGATTTCCAGGT 937 

ii ii i minim 111 i inn m i n m 

Qy 880 CCTGGAGCTAAAGGAGATCAAGGACAACCTGGACTGCAGGGTGTTCCGGGCCCTCCTGGT 939 

Db 938 CCAATAGGTCCTCCGGGTCTTAAAGGTGATCGGGGAGCAATTGGCTTTCCTGGAAGTCGA 997 

II I II I II III II Mill I II Mill Mill I I 
Qy 940 GCAGTGGGACACCCAGGTGCCAAGGGTGAGCCTGGCAGTGCTGGCTCCCCTGGGCGAGCA 999 

Db 998 GGACTCCCAGGATATGCCGGAAGGCCAGGAAATTCTGGACCAAAAGGCCAGAAAGGGGA 1056 

Mill Mill MM II llllll I II I Mill IMIIIII 
Qy 1000 GGACTTCCAGGGAGCCCCGGGAGTCCAGGAGCCACAGGCCTGAAAGGAAGCAAAGGGGA 1058 



RESULT 9 

ID US-08-154-365-1 STANDARD; DNA; UNC; 2037 BP. 

AC xxxxxx 

DT 

DE Sequence 1, Application US/08154365 

CC Sequence 1, Application US/08154365 

CC Patent No. 5624904 

CC GENERAL INFORMATION; 

CC APPLICANT: Dunne, Dana W, 

CC APPLICANT: Resnick, David 

CC APPLICANT: Kreiger, Monty 

CC APPLICANT: Joiner, Keith A. 

CC TITLE OF INVENTION: Method for Treating Gram -Positive 

CC TITLE OF INVENTION: Septicemia 

CC NUMBER OF SEQUENCES: 2 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Patrea L. Pabst 

CC STREET: 1100 Peachtree Street, Suite 2800 

CC CITY: Atlanta 

CC STATE: Ga 

CC COUNTRY: USA 

CC ZIP: 30309-4530 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1.0, Version 11.25 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/154,365 

CC FILING DATE: 

CC CLASSIFICATION: 514 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Pabst, Patrea L. 

CC REGISTRATION NUMBER: 31,284 

CC REFERENCE/DOCKET NUMBER; MIT6392 

CC TELECOMMUNICATION INFORMATION; 

CC TELEPHONE: (404 ) -815 -6508 

CC TELEFAX: ( 404 )- 815 - 6555 

CC INFORMATION FOR SEQ ID NO: 1; 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 2037 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE; cDNA 

CC HYPOTHETICAL: NO 

CC ANTI-SENSE; NO 

CC ORIGINAL SOURCE : 

CC ORGANISM: homo sapien 

CC IMMEDIATE SOURCE; 

CC LIBRARY: THP-1 

CC PUBLICATION INFORMATION; 

CC AUTHORS: Ashkenas, et al. 

CC JOURNAL; J. Lipid Res, 

CC VOLUME: 34 

CC PAGES: 983-1000 

CC DATE: 1993 

CC RELEVANT RESIDUES IN SEQ ID NO: 1: FROM 1 TO 2037 



SQ SEQUENCE 2037 BP; 656 A; 371 C; 435 G; 575 T; 0 OTHER. 

Query Match 2.4%; Score 37; DB 1; Length 2037; 

Best Local Similarity 60.31; Pred. No. 1.72e-08; 

Matches 108; Conservative 0; Mismatches 71; Indels 0; Gaps 0 

Db 878 CCGGGTGAAAAAGGAGATCGAGGTCCCACTGGAGAAAGTGGTCCACGAGGATTTCCAGGT 937 

II II I Mill 'I Ml | Mill III I M III III 
Qy 880 CCTGGAGCTAAAGGAGATCAAGGACAACCTGGACTGCAGGGTGTTCCGGGCCCTCCTGGT 939 

Db 938 CCAATAGGTCCTCCGGGTCTTAAAGGTGATCGGGGAGCAATTGGCTTTCCTGGAAGTCGA 997 

II I II I II Ml II Mill I II IMM Mill I I 
Qy 940 GCAGTGGGACACCCAGGTGCCAAGGGTGAGCCTGGCAGTGCTGGCTCCCCTGGGCGAGCA 999 

Db 998 GGACTCCCAGGATATGCCGGAAGGCCAGGAAATTCTGGACCAAAAGGCCAGAAAGGGGA 1056 

Mill Mill MM II MUM I II I Mill IMMIII 

Qy 1000 GGACTTCCAGGGAGCCCCGGGAGTCCAGGAGCCACAGGCCTGAAAGGAAGCAAAGGGGA 1058 



RESULT 10 

ID 5510466-3 STANDARD; DNA; UNC; 1720 BP. 

AC xxxxxx 

DT 01-JAN-1900 

DE Patent No. 5510466. 

CC Patent No. 5510466 

CC APPLICANT: KREIGER, MONTY; KODAMA, TATSUHIKO 

CC TITLE OF INVENTION: SCAVENGER RECEPTOR PROTEIN AND ANTIBODY 

CC THERETO 

CC NUMBER OF SEQUENCES: 12 

CC CURRENT APPLICATION DATA; 

CC APPLICATION NUMBER: US/08/307,400 

CC FILING DATE: 16-SEP-1994 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: 997,113 

CC FILING DATE: 24-DEC-1992 

CC APPLICATION NUMBER: 391,486 

CC FILING DATE: 09-AUG-1989 

CC APPLICATION NUMBER: 272,002 

CC FILING DATE: 15-NOV-1988 

CC SEQ ID NO: 3: 

CC LENGTH: 1588 

SQ Sequence 1720 BP; 506 A; 304 C; 367 G; 411 T; 132 other; 

Query Match 2.3%; Score 36; DB 3; Length 1588; 

Best Local Similarity 59.8%; Pred. No. 6.57e-08; 
Matches 110; Conservative 0; Mismatches 74; Indels 0; Gaps 0 

Db 824 CTGGACCTCCAGGTGAAAAAGGAGATAGAGGCCCTCCTGGACAAAATGGTATACCAGGCT 883 

MM III II I IMIIIMI III I IN I III I II III 
Qy 872 CTGGTTTTCCTGGAGCTAAAGGAGATCAAGGACAACCTGGACTGCAGGGTGTTCCGGGCC 931 

Db 884 TTCCAGGTCTAATAGGTACTCCAGGTCTTAAAGGTGATCGGGGGATCTCTGGTTTACCTG 943 

III III I I II llllll II Mill I II I MM I MM 
Qy 932 CTCCTGGTGCAGTGGGACACCCAGGTGCCAAGGGTGAGCCTGGCAGTGCTGGCTCCCCTG 991 

Db 944 GAGTTCGAGGATTCCCAGGACCAATGGGGAAGACCGGGAAGCCAGGACTTAATGGACAAA 1003 

I MM I Mill MM I II MM II II Ml I 
Qy 992 GGCGAGCAGGACTTCCAGGGAGCCCCGGGAGTCCAGGAGCCACAGGCCTGAAAGGAAGCA 1051 

Db 1004 AAGG 1007 
MM 

Qy 1052 AAGG 1055 



RESULT 11 

ID US-08-167-939A-5 STANDARD; DNA; UNC; 1196 BP. 

AC xxxxxx 

DT 

DE Sequence 5, Application US/08167939A 

CC Sequence 5, Application US/08167939A 

CC Patent No. 5578703 
CC GENERAL INFORMATION: 

CC APPLICANT: ichijo, Hidenori; Miyazono, Kohei; 
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CC applicant: R nnstrand, Lars; Hellman, Ulf; Wernstedt, Christer; 

CC APPLICANT: Heldin, Carl-Henrik 

CC TITLE OF INVENTION: Substantially Pure Receptor-Like 

CC TITLE OF INVENTION: TGF-al Binding Molecules And Uses Thereof 

CC NUMBER OF SEQUENCES: 14 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Felfe fi Lynch 

CC STREET: 805 Third Avenue 

CC CITY: New York City 

CC STATE: New York 

CC COUNTRY: USA 

CC ZIP: 10022 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Diskette, 5,25 inch, 360 kb storage 

CC COMPUTER: IBM 

CC OPERATING SYSTEM: PC-DOS 

CC SOFTWARE: WordPerfect 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/167, 939A 

CC FILING DATE: 22-APRIL-1994 

CC CLASSIFICATION: 530 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: PCT/US92/05199 

CC FILING DATE: 18 -JUNE- 1992 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Hanson, No, 5578703man D. 

CC REGISTRATION NUMBER: 30,946 

CC REFERENCE/DOCKET NUMBER: LUD 5259.1 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (212) 688-9200 

CC TELEFAX: (212) 838-3884 

CC INFORMATION FOR SEQ ID NO: 5: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1196 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

SQ SEQUENCE 1196 BP; 257 A; 363 C; 387 G; 189 T; 0 OTHER. 

Query Match 2,2%; Score 35; DB 1; Length 1196; 

Best Local Similarity 61,0%; Pred. No. 2,48e-07; 

Matches 97; Conservative 0; Mismatches 62; Indels 0; Gaps 0; 

Db 147 AGGCTGCCCGGGGCTGCCTGGAGCCGCAGGGCCCAAGGGAGAGGCGGGCGCCAGTGGACC 206 

mi i mi i i ii i i ii iiiiiiiiiiiiiiii i inn 

Qy 567 AGGCCCCTCGGGACCCCAAGGCCCACCGGGAGTCAAGGGAGAGGCGGGCCTCCAAGGACC 626 

Db 207 GAAGGGAGGACAAGGCCCTCCCGGAGCCCCTGGGGAGCCAGGACCCCCCGGGCCCAAAGG 266 

mi i i iii i inin mi mm n n 

Qy 627 CCAGGGTGCTCCAGGGAAGCAAGGAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGGG 686 

Db 267 AGACCGAGGGGAGAAGGGCGAGCCTGGACCAAAAGGAGA 305 

I III II III III MM || 

Qy 687 CAGCAAAGGCGATGGGGGTCTCATTGGCCCAAAAGGGGA 725 



RESULT 12 

ID 5510466-1 STANDARD; DNA; UNC; 1957 BP. 

AC xxxxxx 

DT 0WAN-1900 

DE Patent No. 5510466. 

CC Patent No. 5510466 

CC APPLICANT: KREIGER, MONTY; KODAMA, TATSUHIKO 

CC TITLE OF INVENTION; SCAVENGER RECEPTOR PROTEIN AND ANTIBODY 

CC THERETO 

CC NUMBER OF SEQUENCES: 12 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/307,400 

CC FILING DATE: 16-SEP-1994 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: 997,113 

CC FILING DATE: 24-DEC-1992 

CC APPLICATION NUMBER: 391,486 



CC FILING DATE: 09-AUG-19B9 

CC APPLICATION NUMBER: 272,002 

CC FILING DATE: 15-NOV-1988 

CC SEQ ID NO:l: 

CC LENGTH: 1807 

SQ Sequence 1957 BP; 556 A; 384 C; 406 G; 461 T; 150 other; 

Query Match 2.2%; Score 35; DB 3; Length 1807; 

Best Local Similarity 62.6%; Pred. No. 2.48e-Q7; 

Matches 87; Conservative 0; Mismatches 52; Indels 0; Gaps 0; 

Db 856 CTGGACCTCCAGGTGAAAAAGGAGATAGAGGCCCTCCTGGACAAAATGGTATACCAGGCT 915 

llll Ill II I MM III | lllllll I III I II III 
Qy 872 CTGGTTTTCCTGGAGCTAAAGGAGATCAAGGACAACCTGGACTGCAGGGTGTTCCGGGCC 931 

Db 916 TTCCAGGTCTAATAGGTACTCCAGGTCTTAAAGGTGATCGGGGGATCTCTGGTTTACCTG 975 

III III I I II MM II lllll I II I llll I llll 
Qy 932 CTCCTGGTGCAGTGGGACACCCAGGTGCCAAGGGTGAGCCTGGCAGTGCTGGCTCCCCTG 991 

Db 976 GAGTTCGAGGATTCCCAGG 994 

I llll I lllll 
Qy 992 GGCGAGCAGGACTTCCAGG 1010 



RESULT 13 

ID US-07-972-032-80 STANDARD; DNA; UNC; 186 BP. 

AC xxxxxx 

DT 

DE Sequence 80, Application US/07972032 

CC Sequence 80, Application US/07972032 

CC Patent No. 5496712 

CC GENERAL INFORMATION: 

CC APPLICANT: Cappello, Joseph 

CC APPLICANT: Ferrari, Franco A. 

CC TITLE OF INVENTION: HIGH MOLECULAR WEIGHT COLLAGEN -LIKE 

CC TITLE OF INVENTION: PROTEIN POLYMERS 

CC NUMBER OF SEQUENCES: 85 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE; Bertram I. Rowland 

CC STREET: 4 Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE; California 

CC COUNTRY: USA 

CC ZIP: CA 94111 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1.0, Version 11.25 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/07/972,032 

CC FILING DATE: 19921105 

CC CLASSIFICATION; 435 

CC PRIOR APPLICATION DATA; 

CC APPLICATION NUMBER; US/07/791,960 

CC FILING DATE: 12-NOV-1991 

CC ATTORNEY/AGENT INFORMATION; 

CC NAME: Rowland, Bertram I. 

CC REGISTRATION NUMBER: 20,015 

CC REFERENCE/DOCKET NUMBER: A*55556-1/BIR;PROP-08"1 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 781-1989 

CC TELEFAX: (415) 398-3249 

CC INFORMATION FOR SEQ ID NO; 80; 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH; 186 base pairs 

CC TYPE; NUCLEIC ACID 

CC STRANDEDNESS: double 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: CDNA 

CC FEATURE: 

CC NAME/KEY; CDS 

CC LOCATION: 1. .180 
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SQ SEQUENCE 186 BP; 27 A; 58 C; 77 G; 24 T; 0 OTHER. 

Query Match 2,1%; Score 33; DB 1; Length 186; 

Best Local Similarity 60,4%; Pred, No. 3,41e-06; 

Matches 96; Conservative 0; Mismatches 63; Indels 0; Gaps 0; 

Db 28 GGTGCACCGGGTCTGCAGGGTGCACCGGGAGCGCCAGGTAGCCAGGGTGCACCGGGATTG 87 

lllll I! It I li III MM Ell II I INI II Mil 
Qy 496 GGTGCCCCTGGCCCGCCGGGACCACCTGCTGAGAAGGGAGCCAAGGGGGCTATGGGACGA 555 

Db 88 CAGGGGGCTCCGGGTGCACCAGGTAGCCAGGGAGCACCGGGTCTGCAAGGAGCACCGGGT 147 

I II II I II I I II III II lllllll I I INI Mil 
Qy 556 GATGGAGCAACAGGCCCCTCGGGACCCCAAGGCCCACCGGGAGTCAAGGGAGAGGCGGGC 615 

Db 148 GCACCGGGATCCCAGGGCGCTCCGGGCCTGCAAGGTGCC 186 

I 111 lllllll Mill II llllll III 
Qy 616 CTCCAAGGACCCCAGGGTGCTCCAGGGAAGCAAGGAGCC 654 



RESULT 14 

ID US-08-238-163-5 STANDARD; DNA; UNC; 215 BP. 

AC xxxxxx 

DT 

DE Sequence 5, Application US/08238163 

CC Sequence 5, Application US/08238163 

CC Patent No. 5569830 

CC GENERAL INFORMATION.: 

CC APPLICANT: BENNETT, Alan 

CC APPLICANT: LABAVITCH, John M. 

CC APPLICANT: POWELL, Ann 

CC APPLICANT; STOTZ, Henrik 

CC TITLE OF INVENTION: PLANT INHIBITORS OF FUNGAL 

CC TITLE OF INVENTION: POLYGALACTURONASES AND THEIR USE TO CONTROL FUNGAL DISEASE 

CC NUMBER OF SEQUENCES: 24 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Townsend and Tovnsend Khourie and Crew 

CC STREET: Steuart Street Tower, One Market Plaza 

CC CITY: San Francisco 

CC STATE: California 

CC COUNTRY: US 

CC ZIP: 94105-1493 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release II. 0, Version #1.25 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/238,163 

CC FILING DATE: 03-MAY-1994 

CC CLASSIFICATION: 800 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Bastian, Kevin L. 

CC REGISTRATION NUMBER: 34,774 

CC REFERENCE/DOCKET NUMBER: 2307E-540 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 543-9600 

CC TELEFAX: (415) 543-5043 

CC INFORMATION FOR SEQ ID NO: 5: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 215 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: unknown 

CC MOLECULE TYPE: protein 

CC FEATURE: 

CC NAME/KEY: miscjeature 

CC LOCATION: 1..215 

CC OTHER INFORMATION: /standard_name« "Deduced amino acid 

CC OTHER INFORMATION: sequence of PGIP from bean." 

SQ SEQUENCE 215 BP; 15 A; 8 C; 25 G; 26 T; 141 OTHER. 

Query Match 2.1%; Score 33; DB 1; Length 215; 
Best Local Similarity 16.2%; Pred. No, 3.41e-06; 



Matches 33; Conservative 78; Mismatches 90; Indels 3; Gaps 3; 

Db 7 SSSSWSRTASCNDKAKKDGNTTSSWTTDCCNRTW-GVCDTDTTYRVNNDSGHNKYSSAN 65 

;:::; :: :| :: ::: ||: : I : |:|:|:||::: : h :| 
Cp 489 GCCCATGGCCCCCTTGTGACCTTGAAGACCTGGGGCGCCTTGTTCACCTTTGATTCTGAA 430 

Db 66 YNYGGNNVGAAKTHYYTHTNVSGADSKTVTDSYNASGTSSSNGGTDGNRSGADSYGSSKT 125 1 

: ; ;| :|:: | : | ::: :: I :: ||: :::| :: 
Cp 429 CATCCCTGGG-TTCTGAGTGAAGTTGTCTACCCGCTGCAGCAAGTGCTCATGGCTGACGC 371 

Db 126 AMTSRNRTGKTANNAVDSRNMGDASVGSDKNTKKHAKNSADGKVGSKNNGDRNNRYGTGT 185 

: : |: | :;: I I:: ::: :: : :|:| : : |:: : Mil 
Cp 370 GGACCCAGGTGAGTTGGGCCTGCAGGACTTGCAGCCTCGATGCACC-CTGAGCCAGGTGT 312 

Db 186 KSNVSNNCGGGNKRDVSSYANNKC 209 

:: :: I I : :: :: :| 
Cp 311 TCTCCAGGGTGTGCTGACTGCAGC 288 



RESULT 15 

ID US-07-62H91G-1 STANDARD; DNA; UNC; 1416 BP. 

AC xxxxxx 

DT 

DE Sequence 1, Application US/07621091G 
CC Sequence 1, Application US/07621091G 
CC Patent No. 5424408 

CC GENERAL INFORMATION: 

CC APPLICANT: Reeders, Stephen T., Morrison, Karen E., Hudson, Billy 

CC APPLICANT: G. 

CC TITLE OF INVENTION: Alpha-3 Chain Type IV Collagen 

CC TITLE OF INVENTION: Polynucleotides 

CC NUMBER OF SEQUENCES: 23 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Yale University, Office of Cooperative Research 

CC STREET: 246 Church Street 

CC CITY: New Haven 

CC STATE: Connecticut 

CC COUNTRY: U.S.A. 

CC ZIP: 06510 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Diskette, 3,50 inch, 800K storage 

CC COMPUTER: Apple Macintosh 

CC OPERATING SYSTEM: Macintosh OS7.0 

CC SOFTWARE: Microsoft Word 5.1a 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/07/621, 091G 

CC FILING DATE: 11/30/90 

CC CLASSIFICATION: 424 

CC PRIOR APPLICATION DATA: No. 5424408 applicable 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Barth, Richard S. 

CC REGISTRATION NUMBER: 28180 

CC REFERENCE/DOCKET NUMBER: 900983/RB 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (212) 972-1400 

CC TELEFAX: (212) 370-1622 

CC TELEX: 236268 

CC INFORMATION FOR SEQ ID NO: 1; 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1416 base pairs 

CC TYPE: Nucleic acid 

CC STRANDEDNESS: Double 

CC TOPOLOGY: Linear 

CC MOLECULE TYPE: CDNA to mRNA 

CC HYPOTHETICAL: No 

CC ANTI-SENSE: No 

CC ORIGINAL SOURCE: 

CC ORGANISM: Calf 

CC STRAIN: Unknown 

CC INDIVIDUAL ISOLATE: Unknown 

CC DEVELOPMENTAL STAGE: Unknown 

CC CELL TYPE: Whole kidney 

CC IMMEDIATE SOURCE: 
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CC LIBRARY: Bovine lens cDNA 
CC CLONE; KMC15 

CC POSITION IN GENOME: No. 5424408 known 

CC CHROMOSOME/SEGMENT: No. 5424408 known 

CC PUBLICATION INFORMATION: No. 5424408e 

SQ SEQUENCE 1416 BP; 374 A; 365 C; 393 G; 284 T; 0 OTHER. 

Query Match 2.1%; Score 32; DB 1; Length 1416; 

Best Local Similarity 61.0%; Pred. No. 1.24e-05; 

Matches 89; Conservative 0; Mismatches 57; Indels 0; Gaps 0; 

Db 472 GGAAGCCCAGGCCCACCTGGTTCAGCTGGAGAACCAGGGATGCAAGGAGAACCCGGGCCC 531 

II II lllilllllll INI Mill II II M Ml I || I 
Qy 847 GGGAGGCCAGGCCCACCAGGTTTGGCTGGTTTTCCTGGAGCTAAAGGAGATCAAGGACAA 906 

Db 532 CCAGGACCACCAGGAGATCCAGGACCCTGTGGGCCAAAAGGTAAACCAGGGGAGGATGGT 591 

II llll I II I III II II III il II I Mill I I III 
Qy 907 CCTGGACTGCAGGGTGTTCCGGGCCCTCCTGGTGCAGTGGGACACCCAGGTGCCAAGGGT 966 

Db 592 CCACCAGGAACTCCTGGACCAACTGG 617 

II II I I MM I llll 
Qy 967 GAGCCTGGCAGTGCTGGCTCCCCTGG 992 



Search completed: Fri Sep 11 06:59:56 1998 
Job time : 138 sees. 
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Release 3.1A John F, Collins, Biocomputing Research Unit. 
Copyright (c) 1993-1998 University of Edinburgh, U.K. 
Distribution rights by Oxford Molecular Ltd 



MPsrch_nn n.a 
Run on: 



n.a. database search, using Smith-Waterman algorithm 



Fri Sep 11 07:00:14 1998; MasPar time 1122.70 Seconds 

1310.169 Million cell updates/sec 

Tabular output not generated. 



Title: 

Description: 
Perfect Score: 
N.A. Sequence: 
Comp: 

Scoring table: 



Nmatch STD 
Searched: 



MJS-08-794-795-5 

(1-1560) from US08794795 . seq 

1560 

1 ATGAGAAATAAGAAAATTCT CAGGCGTGGAGTGCAGCGTC 1560 

TACTCTTTATTCTTTT AAGA GTCCGCACCTCACGTCGCAG 

TABLE default 
Gap 6 

Dbase 0; Query 0 

1460335 seqs, 471452172 bases x 2 



Post-processing: Minimum Match 0% 

Listing first 45 summaries 

Database: n-pending 

1:P9 2:U6000 3:U6001 4:U6002 5:U6003 6:U6004 7:U6005 
8:U6006 9:U6007 10:U6008 11:U7 12:U80 13:U81 14 :U82 
15 :U83 16:U84A 17:U84B 18:U85 19:U86 20:U87 21:U88 
22:U89 23:U90A 24:U90B 25:091 26:NEWP 27 : NEWU8 28:NEWU9 

Statistics: Mean 10.635; Variance 4.217; scale 2,522 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



NO. 


Score 


Match Length 


DB 


ID 


Description 


Pred. No, 


1 


1560 


100.0 


1560 


20 


US-08-794- 


Sequence 5, Applicatio 


0.00e+00 


2 


1558 


99.9 


1835 


22 


US-08-934- 


Sequence 2, Applicatio 


0.00e+00 


3 


1523 


97.6 


1703 


20 


US-08-794- 


Sequence 1, Applicatio 


0.00e+00 


4 


1023 


65.6 


1323 


23 


US-09-023- 


Sequence 813, Applicat 


0.00e+00 


5 


742 


47.6 


775 


21 


US-08-837- 


Sequence 1335, Applica 


0.00e+00 


6 


666 


42.7 


1868 


21 


US-08-893- 


Sequence 1, Applicatio 


0.00e+00 


7 


309 


19.8 


311 


24 


US-09-057- 


Sequence 358, Applicat 


2,32e-284 


8 


269 


17.2 


269 


21 


US-08-801- 


Sequence 148, Applicat 


4.20e-242 


9 


252 


16.2 


252 


5 


US-60-039- 


Sequence 1236, Applica 


3.24e-224 


10 


250 


16.0 


250 


21 


US-08-808- 


Sequence 5073, Applica 


4.10e-222 


11 


247 


15.8 


251 


20 


US-08-770- 


Sequence 3424, Applica 


5.79e-219 


12 


242 


15.5 


246 


23 


US-09-016- 


Sequence 839, Applicat 


1.02e-213 


13 


241 


15.4 


241 


7 


US-60-051- 


Sequence 163, Applicat 


1.15e-212 


14 


239 


15.3 


242 


21 


US-08-810- 


Sequence 2282, Applica 


1.43e-210 


15 


228 


14.6 


228 


20 


US-08-770* 


Sequence 3302, Applica 


4.75e-199 


16 


218 


14.0 


218 


20 


US-08-770- 


Sequence 164, Applicat 


1.34e-188 


17 


218 


14.0 


218 


2 


US-60-009- 


Sequence 164, Applicat 


1.34e-188 



18 


216 


13.8 


218 4 


US-60-023* 


Sequence 


1670, Applica 


1.64e 


186 


19 


215 


13.8 


220 16 


US-08 


■413- 


Sequence 


265, Applicat 


l.Sle 


185 


20 


206 


13.2 


207 4 


US-60-020- 


Sequence 


1145, Applica 


4.32e- 


176 


21 


199 


12.8 


211 20 


US-08 


■770' 


Sequence 


93, Applicati 


8.23e 


169 


22 


199 


12.8 


211 2 


US-60 


■009- 


Sequence 


93, Applicati 


8.23e 


169 


23 


197 


12.5 


219 20 


US-08 


■770- 


Sequence 


4015, Applica 


9.84e 


167 


24 


192 


12.3 


294 20 


US-08 


■770- 


Sequence 


4110, Applica 


1.52e 


161 


25 


190 


12.2 


203 20 


US-08 


■770- 


Sequence 


4448, Applica 


1.80e 


159 


26 


188 


12.1 


203 16 


US-08 


-412- 


Sequence 


1358, Applica 


2.12e 


157 


27 


186 


11.9 


239 21 


US-08 


■803- 


Sequence 


5482, Applica 


2.50e 


155 


28 


185 


11.9 


243 23 


US-09-016- 


Sequence 


889, Applicat 


2.71e 


154 


29 


183 


11.7 


220 21 


US-08 


■801- 


Sequence 


1564, Applica 


3.17e 


152 


30 


179 


11.5 


226 4 


US-60-023- 


Sequence 


1927, Applica 


4.32e 


148 


31 


171 


11.0 


207 22 


US-08 


■951- 


Sequence 


793, Applicat 


7.68e 


140 


32 


165 


10.6 


222 3 


US-60-010- 


Sequence 


515, Applicat 


U4e 


133 


33 


165 


10.6 


222 21 


US-08 


-801- 


Sequence 


2393, Applica 


1.14e 


133 


34 


159 


10.2 


590 21 


US-08 


-837- 


Sequence 


915, Applicat 


1.64e 


127 


35 


146 


9.4 


218 4 


US-60 


■025- 


Sequence 


260, Applicat 


3.15e 


114 


36 


146 


9.4 


218 22 


US-08 


-903- 


Sequence 


2238, Applica 


3.15e 


114 


37 


147 


9.4 


227 16 


US-08 


-420- 


Sequence 


523, Applicat 


3.02e 


115 


38 


136 


8.7 


186 4 


US-60 


-023- 


Sequence 


396, Applicat 


4,55e 


104 


C 39 


133 


8.5 


7218 17 


US-08 


-466- 


Sequence 


14, Applicati 


4.94e 


•101 


40 


129 


8.3 


165 3 


US-60 


-013- 


Sequence 


158, Applicat 


5.42e 


■97 


41 


129 


8.3 


165 21 


US-08 


-808- 


Sequence 


158, Applicat 


5.42e 


97 


42 


112 


7.2 


265 20 


US-08 


-770- 


Sequence 


5209, Applica 


5.94e 


80 


43 


99 


6.3 


578 18 


US-08 


-569- 


Sequence 


14, Applicati 


4.30e 


67 


44 


99 


6.3 


578 19 


US-08 


-624- 


Sequence 


14, Applicati 


4.30e 


67 


45 


96 


6.2 


248 21 


US-08 


-810- 


Sequence 


382, Applicat 


3.77e 


64 



RESULT 
ID 
AC 
DT 
DE 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



US-08-794-795-5 STANDARD; 
xxxxxx 



ALIGNMENTS 



DNA; UNC; 1560 BP. 



Sequence 5, Application US/08794795 
Sequence 5, Application US/08794795 
GENERAL INFORMATION: 
APPLICANT: Elshourlagy, Nabil 
APPLICANT: Adamou, John 
APPLICANT: Gross, Mitchell 
APPLICANT: Lysko, Paul 

TITLE OF INVENTION: Human Macro Scavenger Rec 
TITLE OF INVENTION: eptor 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: SmithKline Beecham Corporation 
STREET: 709 Svedeland Road 
CITY: King of Prussia 
STATE: PA 
COUNTRY: USA 
ZIP: 19406 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Diskette 
COMPUTER: IBM Compatible 
OPERATING SYSTEM: DOS 
SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA: 
APPLICATION NUMBER: US/08/794,795 
FILING DATE: Q4-FEB-1997 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: ATG50009P 
FILING DATE: 22-MAY-1996 
ATTORNEY/AGENT INFORMATION: 
NAME: Han, William T 
REGISTRATION NUMBER: 34,344 
REFERENCE/DOCKET NUMBER: ATG50009 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 610-270-5219 
TELEFAX: 610-270-4026 
TELEX: 
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CC INFORMATION FOR SEQ ID NO: 5: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1560 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: single 

CC TOPOLOGY : linear 

CC MOLECULE TYPE: CDNA 

SQ SEQUENCE 1560 BP; 389 A; 387 C; 533 6; 251 T; 0 OTHER. 



Query Match 100.0%; 
Best Local Similarity 100.0%; 
1560; Conservative 



Matches 


Db 


l 


Qy 


l 


Db 


61 


Qy 


61 


Db 


121 


Qy 


121 


Db 


181 


Qy 


181 


Db 


241 


Qy 


241 


Db 


301 


Qy 


301 


Db 


361 


Qy 


361 


Db 


421 


Qy 


421 


Db 


481 


Qy 


481 


Db 


541 


Qy 


541 


Db 


601 


Qy 


601 


Db 


661 


Qy 


661 


Db 


721 


Qy 


721 


Db 


781 


Qy 


781 


Db 


841 


Qy 


841 



Score 1560; DB 20; 
Pred. No. Q.00e+00; 
0; Mismatches 0 



Length 1560; 
Indels 0; Gaps 0; 



IIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
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Db 


901 


Qy 


901 


Db 


961 


Qy 


961 


Db 


1021 


Qy 


1021 


Db 


1081 


Qy 


1081 


Db 


1141 


Qy 


1141 


Db 


1201 


Qy 


1201 


Db 


1261 


Qy 


1261 


Db 


1321 


Qy 


1321 


Db 


1381 


Qy 


1381 


Db 


1441 


Qy 


1441 


Db 


1501 


Qy 


1501 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



IIIIIHIIIIIIIIIIIMIIIIIIIllllllllllMMIIIIIIIIIIIIIIilill 



IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIIMIIIIIIIMIMI 



MUM I M M IMMMMM' II II II III IIMIMM M 
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Sequence 2, Application US/08934168 
Sequence 2, Application US/08934168 
GENERAL INFORMATION: 
APPLICANT; Bandit Olga 
APPLICANT: Lai, Preeti 
APPLICANT; Corley, Neil C. 
APPLICANT; Shah, Purvi 
TITLE OF INVENTION: HUMAN MARCO 
NUMBER OF SEQUENCES; 3 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Incyte Pharmaceuticals, Inc. 

STREET: 3174 Porter Drive 

CITY: Palo AltO 

STATE: CA 

COUNTRY: USA 

ZIP: 94304 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for windows Version 2.0 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/934,168 

FILING DATE: Herewith 
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CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: 

CC FILING DATE: 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Billings, Lucy J. 

CC REGISTRATION NUMBER: 36,749 

CC REFERENCE/DOCKET NUMBER: PF-0392 US 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: 650-855-0555 

CC TELEFAX: 650-845-4166 

CC TELEX: 

CC INFORMATION FOR SEQ ID NO: 2: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1835 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS; single 

CC TOPOLOGY: linear 

CC IMMEDIATE SOURCE: 

CC LIBRARY: MMLR3DT01 

CC CLONE: 569648 

SQ SEQUENCE 1835 BP; 454 A; 451 C; 610 6; 320 T; 0 OTHER. 



Query Match 99.9%; 
Best Local Similarity 99.9%; 
1559; Conservative 



Matches 


Db 


119 


Qy 


1 


Db 


179 


Qy 


61 


Db 


239 


Qy 


121 


Db 


299 


Qy 


181 


Db 


359 


Qy 


241 


Db 


419 


Qy 


301 


Db 


479 


Qy 


361 


Db 


539 


Qy 


421 


Db 


599 


Qy 


481 


Db 


659 


Qy 


541 


Db 


719 


Qy 


601 


Db 


779 



Score 1556; DB 22; 
Pred. no. 0.00e+00; 
0; Mismatches 1; 



Length 1835; 
Indels 0; Gaps 0; 



lllllllllimillMlillllllillllllllimillllllMIIIIIIIIIH 



IIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII! 



Illllllllllllllllllllllllllllllllllimillllllllllllllllllll 



;inr :iiMiiHr ii ,iifiiiiii: iiiMini: NiMH in 



IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



iiiiiiiiiiiiiiiiimiiimimiiiiiiiiiiiiiiiiiiMiiiiiiiii! 



iiimiiiiiiiiiiimiiiiiiimiiiimimiiiiimiiiiiiiiiii 



iiiiiiiiimiiiiiiiiiimiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiii 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiini 



iiiiMiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiimimiiii 



iiiiiiiiiiiiimiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiiiiiiiii 



Qy 661 ACCCCAGGACCCCMGGAGAGMGGGCAGCAAAGGCGATGGGGGTCTCATTGGCCCAAAA 720 



IIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



III IINI MINI III llll I MUiUI UJI'I i 



lllll Ml llllllllllllllllllllllllllllllllllllllllllllll 



llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



ihlll MIIIIMI lllllll, Mllllllll H'lllll: Mlllllli 



mm iimiiii iiinii I'Minii 'mini iiniiiii 



IMMMMM MMMI lil'IMI IMMI MMII 



MIM MINIM MMM IMIMI I'MM IMMM 



I'MIMMMM: IIIM'I M'I'M II'MI IIMM 



MM IMIMI IMIM MMM! M IM MIM 



IMMM MIM IMIM MMM IIMM MMM 



MM II IMMM MMM Ml MM IIMM IMMM 



MMMMIMI IMMIMMM MMMMMMMM MMMM 



Mill! IIMMIIII IMMM MMMMMMMM MMM 



yy 


QUI 


nh 

UD 




Civ 

Qy 


V)] 
1 Li 


nK 
UD 


oy y 


Air 

Qy 


7fl1 
/Ol 


nK 
DD 


quo 


vy 


ail 


nK 


1 m q 


Ov 


901 


nh 




wy 


Qfil 




1 1 1Q 


Qy 


1 noi 


UD 


1 1 AO 


Qy 


1 npi 


UD 


1 IRQ 


uy 




nK 
UD 


1 71Q 


uy 


LlKJX 


DD 


1 no 


Qy 


1 osi 


nK 
DD 


1 nid 


uy 


i 

1 jZI 


nK 




oy 


1381 


Db 


1559 


oy 


1441 


Db 


1619 


oy 


1501 



RESULT 
ID 
AC 



US-08-794-795-1 STANDARD; DNA; UNC; 1703 BP, 
xxxxxx 



DT 

DE Sequence 1, Application US/08794795 

CC Sequence 1, Application US/08794795 

CC GENERAL INFORMATION: 

CC APPLICANT: Elshourlagy, Nabil 

CC APPLICANT: Adamou, John 

CC APPLICANT: Gross, Mitchell 

CC APPLICANT: Lysko, Paul 

CC TITLE OF INVENTION: Human Macro Scavenger Rec 

CC TITLE OF INVENTION: eptor 

CC NUMBER OF SEQUENCES: 9 



Fri Sep 11 07:51:44 1998 US-08-794-795-5 .rnp Page 4 



cc 


CORRESPONDENCE ADDRESS: 








1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 (1 1 1 1 

1 1 ! 1 : 1 1 1 M 1 1 M 1 1 1 M 1 1 1 1 1 1 1 ' 1 1 1 1 1 1 ! 1 1 1 ; M 1 1 1 1 1 i 1 1 1 1 1 1 1 IMIII 




cc 




ADDRESSEE: SmithKline Beecham Corporation 




Qy 


509 


ppppppmippuppmppmpupn. tlpppuppp* tippppppmnmpppit ppupt, mPPuPP \ hp lip 

CGCCGGGACCACCTGCTGAGAAGGGAGCCAAGGGGGCTATGGGACGAGATGGAGCAACAG 


GCO 


cc 




STREET: 709 Svedeland Road 












cc 




CITY: King of Prussia 




Db 


544 


GCCCCTCGGGACCCCAAGGCCCACCGGGAGTCAAGGGAGAGGCGGGCCTCCAAGGACCCC 


503 


cc 




STATE: PA 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 M M 1 1 ! 1 1 1 1 II 1 M 1 M 1 1 1 1 M 1 II 




cc 




COUNTRY: USA 




Qy 


569 


GCCCCTCGGGACCCCAAGGCCCACCGGGAGTCAAGGGAGAGGCGGGCCTCCAAGGACCCC 




cc 




ZIP: 19406 












cc 


COMPUTER READABLE FORM: 




Db 


604 


AGGGTGCTCCAGGGAAGCAAGGAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGGGCA 




cc 




MEDIUM TYPE: Diskette 








1 1 1 1 1 1 1 1 \ 1 ( 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

Mill IIIIIMMIMIIIIIIMIIIMIIIilllllllllll IIIIIIMIIIIHI 




cc 




COMPUTER: IBM Compatible 




Qy 


629 


AGGGTGCTCCAGGGAAGCAAGGAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGGGCA 


coo 
000 


cc 




OPERATING SYSTEM: DOS 












cc 




SOFTWARE: FastSEQ for Windows Version 2.0 




Db 


664 


GCAAAGGCGATGGGGGTCTCATTGGCCCAAAAGGGGAAACTGGAACTAAGGGAGAGAAAG 


Til 

723 


cc 


CURRENT APPLICATION DATA: 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
Mill III IIMMIMIMIIIIIMMIIIIIII llllllllllllllllll MINI 




cc 




APPLICATION NUMBER: US/08/794,795 




Qy 


689 


GCAAAGGCGATGGGGGTCTCATTGGCCCAAAAGGGGAAACTGGAACTAAGGGAGAGAAAG 


7 j ft 

74o 


cc 




FILING DATE: 04-FEB-1997 












cc 




CLASSIFICATION: 435 




Db 


724 


GAGACCTGGGTCTCCCAGGAAGCAAAGGGGACAGGGGCATGAAAGGAGATGCAGGGGTCA 


783 


cc 


PRIOR APPLICATION DATA: 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 

MIIIMIMIIIIII IIMIII IIMIIIIIIIMMMIIMIIIIIIIIMMIMI 




cc 




APPLICATION NUMBER: ATG50009P 




Qy 


749 


GAGACCTGGGTCTCCCAGGAAGCAAAGGGGACAGGGGCATGAAAGGAGATGCAGGGGTCA 


808 


cc 




FILING DATE: 22-MAY-1996 










cc 


ATTORNEY/AGENT INFORMATION: 




Db 


784 


TGGGGCCTCCTGGAGCCCAGGGGAGTAAAGGTGACTTCGGGAGGCCAGGCCCACCAGGTT 


843 


cc 




NAME; Han, William T 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
Mill MIIIIIIMMIMIIMIMIIIIIIIMIIMIII IIIIIIIIIIMMMI 




cc 




REGISTRATION NUMBER: 34,344 




Qy 


809 


TGGGGCCTCCTGGAGCCCAGGGGAGTAAAGGTGACTTCGGGAGGCCAGGCCCACCAGGTT 


868 


cc 




REFERENCE/DOCKET NUMBER: ATG50009 












cc 


TELECOMMUNICATION INFORMATION: 




Db 


844 


TGGCTGGTTTTCCTGGAGCTAAAGGAGATCAAGGACAACCTGGACTGCAGGGTGTTCCGG 


903 


cc 




TELEPHONE: 610-270*5219 








Ml IIMIMI IIHNINI II NMI lliniMIIMMMIIIIiinilll UN 




cc 




TELEFAX: 610-270*4026 




Qy 


869 


TGGCTGGTTTTCCTGGAGCTAAAGGAGATCAAGGACAACCTGGACTGCAGGGTGTTCCGG 


928 


cc 




TELEX: 












cc 


INFORMATION FOR SEQ ID NO: 1: 




Db 


904 


GCCCTCCTGGTGCAGTGGGACACCCAGGTGCCAAGGGTGAGCCTGGCAGTGCTGGCTCCC 


963 


cc 


SEQUENCE CHARACTERISTICS: 








1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 

IIIIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




cc 




LENGTH: 1703 base pairs 




Qy 


929 


GCCCTCCTGGTGCAGTGGGACACCCAGGTGCCAAGGGTGAGCCTGGCAGTGCTGGCTCCC 


988 


cc 




TYPE: nucleic acid 










cc 




STRANDEDNESS: single 




Db 


fie i 
964 


CTGGGCGAGCAGGACTTCCAGGGAGCCCCGGGAGTCCAGGAGCCACAGGCCTGAAAGGAA 




cc 




TOPOLOGY: linear 








1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 

1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 




cc 


MOLECULE TYPE: cDNA 




Qy 




pmpppppupp app tiPrpmppi\ppp\pppppppp \p<ppp?iPP'\PPP'!\PTippppwnn h TiPPnn 

CTGGGCGAGCAGGACTTCCAGGGAGCCXIGGGAGT^^ 


1U40 


SO 


SEQUENCE 1703 BP; 437 A; 423 C; 567 G; 275 T; 1 OTHER, 




















ni-. 

DD 




PPTi A UPPPPTiPTlPTiPPIiPWfTiPTiUPPTiPbPPll fiPPH \ \ H llPPUP JV ntvPurpnpivwrFtLr 

GCAMGGGGACACAGGACTTCAAGGAU\GIMGGAAGAAMG 




Query Match 97.6%; Score 1523; DB 20; Length 1703; 








i i i i i i i i i i r i i i i ii i i i i i ii i i I I i I I I I I I I I I I I I I I I I it I I I I I I I 1 i 1 1 1 1 

IIIIIIIIIIIIIIIMIIIIIIIIMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIII 




Best Local Similarity 99.9%; Pred, No. 0 . 00e+00; 




Qy 


1 f\A ft 

1049 


npy \ hppppi,p\p \ pPuPmrnP?*. \PP\P\PPt\.yiPPn'hPyi.\T\. \PPHP\ MTIP \PPHPff\tVPP 

GCAMGGGG AC ACAGGACTTC AAGGACAGC AAGGAAG AAAAGG AG AATUGbAG 1 1 HAb 




Matches 


1530; Conservative 0; Mismatches 1; Indels 1; Gaps 1; 
















Db 


1084 


nnnnrr\PP\PPmnmP\ uPPPuP* » PuPPPPUPPPP \PPPPfT\PPP\PPn\PPP ft \ POP \PPPP 

GCCCTGCAGGTGTGAAGGGAGAACAGGGGAGCCCAGGGCTGGCAGGTCCCAAGGGAGCCC 


1 1 a i 


Db 


4 


ACGAGCTCTTGAGTGAGACCCAACAAGCTGCTTTTCACCAAATTGCAATGGAGCCTTTCG 


63 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M II 1 1 1 1 1 1 1 1 1 1 

Ml IMIMIIMMMIlMlMIlM MMMMIIIII llll IIMIIIMIIIMI 








lllllllllllllltlllllllllllllllllllllllllllllllllilllllllllll 




Qy 


1109 


GCCCTGCAGGTGTGAAGGGAGAACAGGGGAGCCCAGGGCTGGCAGGTCCCAAGGGAGCCC 




Qy 


29 


ACGAGCTCTTGAGTGAGACCCAACAAGCTGCTTTTCACCAAATTGCAATGGAGCCTTTCG 


88 
















Db 


1144 


CTGGACAAGCTGGCCAGAAGGGAGACCAGGGAGTGAAAGGATCTTCTGGGGAGCAAGGAG 


lzUi 


Db 


64 


AAATCAATGTTCCAAAGCCCAAGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGG 


123 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 L 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

M 1 1 1 M N II 1 IN M 1 ! 1 1 1 M 1 1 1 Ml 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 M 1 1 1 1 








II NilllINN IN IN NIMH II II .N NNNI 




Qy 


1169 


CTGGACAAGCTGGCCAGAAGGGAGACCAGGGAGTGAAAGGATCTTCTGGGGAGCAAGGAG 


1228 


Qy 


89 


AAATCAATGTTCCAAAGCCCAAGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGG 


148 
















Db 


1204 


TAAAGGGAGAAAAAGGTGAAAGAGGTGAAAACTCAGTGTCCGTCAGGATTGTCGGCAGTA 


1263 


Db 


124 


TCATCTACCTGATCCTGCTCACCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATC 


183 






IIMNINII MMI IIMMNIMI II II MIMMNMNI IMMIII III IN 








N MMIMIIM MM IM MIIIMM II II N; MNMN M 




Qy 


1229 


TAAAGGGAGAAAAAGGTGAAAGAGGTGAAAACTCAGTGTCCGTCAGGATTGTCGGCAGTA 


1288 


Qy 


149 


TCATCTACCTGATCCTGCTCACCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATC 


208 
















Db 


1264 


GTAACCGAGGCCGGGCTGAAGTTTACTACAGTGGTACCTGGGGGACAATTTGCGATGACG 


1323 


Db 


184 


TGCAGGCGCGGCTCCGGGTCCTGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGG 


243 






M 1 1 1 M 1 ! II 1 1 M N 1 1 1 M M ! 1 1 1 IN Mi 1 M M M 1 1 1 1 III! Ill i 1 1 II 1 M 








M MMMMM II Ml MMMIM M M: II NIMM M 




Qy 


1289 


GTAACCGAGGCCGGGCTGAAGTTTACTACAGTGGTACCTGGGGGACAATTTGCGATGACG 


1348 


Qy 


209 


TGCAGGCGCGGCTCCGGGTCCTGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGG 


268 
















Db 


1324 


AGTGGCAAAATTCTGATGCCATTGTCTTCTGCCGCATGCTGGGT-ACTCCAAAGGAAGGG 


1382 


Db 


244 


ACAGCCCGTCCTTCTCCTTGCTGCAGTCAGCACACCCTGGAGAACACCTGGCTCAGGGTG 


303 






1 1 1 t 1 1 1 1 i 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 J 1 1 1 1 1 t 1 1 L 1 1 1 1 1 M J 1 1 1 1 1 

II ININ IIIIMI IIMIII NMI II IMIIMIMIIM 1 1 1 1 1 1 1 II 1 1 1 1 1 1 








M M MIMMM II II IIIIMM M M II IMMM M 




Qy 


1349 


AGTGGCAAAATTCTGATGCCATTGTCTTCTGCCGCATGCTGGGTTACTCCAAAGGAAGGG 


1 A no 


Qy 


269 


ACAGCCCGTCCTTCTCCTTGCTGCAGTCAGCACACCCTGGAGAACACCTGGCTCAGGGTG 


328 


















Db 


1383 


CCCTGTACAAAGTGGGAGCTGGCACTGGGCAGATCTGGCTGGATAATGTTCAGTGTCGGG 


1442 


Db 


304 


CATCGAGGCTGCAAGTCCTGCAGGCCCAACTCACCTGGGTCCGCGTCAGCCATGAGCACT 


363 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 IIIIIIIIIIIIIIMII IIMIII IMIIIIII llll llll IMMMIIMIIM II 








I M IMMMM M 1 M MMMM M M M IMMM 




Qy 


1409 


CCCTGTACAAAGTGGGAGCTGGCACTGGGCAGATCTGGCTGGATAATGTTCAGTGTCGGG 


1 A CO 


Qy 


329 


CATCGAGGCTGCAAGTCCTGCAGGCCCAACTCACCTGGGTCCGCGTCAGCCATGAGCACT 


388 


















Db 


1443 


GCACGGAGAGTACCCTGTGGAGCTGCACCAAGAATAGCTGGGGCCATCATGACTGCAGCC 


1502 


Db 


364 


TGCTGCAGCGGGTAGACAACTTCACTCAGAACCCAGGGATGTTCAGAATCAAAGGTGAAC 


423 






i i i i i i i i i i i i i l i i i i i i i i t i i i i i i r i i i i i 1 i I 1 1 1 I 1 1 1 I M 1 1 1 1 1 1 1 1 1 1 1 1 

Ml Ml MMI INN IMNII MINIMI IIIMIINIMI II IIIIMIIM II 1 








lllllllfllllllllllllllllllllllllllilllllllllllllllllllllllll 




Qy 


1469 


GCACGGAGAGTACCCTGTGGAGCTGCACCAAGAATAGCTGGGGCCATCATGACTGCAGCC 


1528 


Qy 


389 


TGCTGCAGCGGGTAGACAACTTCACTCAGAACCCAGGGATGTTCAGAATCAAAGGTGAAC 


448 


















Db 


1503 


ACGAGGAGGACGCAGGCGTGGAGTGCAGCGTC 1534 




Db 


424 


AAGGCGCCCCAGGTCTTCAAGGTCACAAGGGGGCCATGGGCATGCCTGGTGCCCCTGGCC 


483 






1 N NNNI II M N NN 








Mill MlNNN Ni 1 MMMIN N 1 M NINI N 




Qy 


1529 


ACGAGGAGGACGCAGGCGTGGAGTGCAGCGTC 1560 




Qy 


449 


AAGGCGCCCCAGGTCTTCAAGGTCACAAGGGGGCCATGGGCATGCCTGGTGCCCCTGGCC 


508 








Db 


484 


CGCCGGGACCACCTGCTGAGAAGGGAGCCAAGGGGGCTATGGGACGAGATGGAACAACAG 


543 


RESULT 


4 
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ID CJS-09-023-655-813 STANDARD; DNA; UNC; 1323 BP. 

AC xxxxxx 

DT 

DE Sequence 813, Application US/09023655 
CC Sequence 813, Application US/09023655 

CC GENERAL INFORMATION: 

CC APPLICANT: Cocks, Benjamin G. 

CC APPLICANT; Susan G. Stuart 

CC APPLICANT: Jeffrey J. Seilhamer 

CC TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF BLOOD CELL GENE 

CC TITLE OF INVENTION: EXPRESSION 

CC NUMBER OF SEQUENCES: 1508 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

CC STREET: 3174 PORTER DRIVE 

CC CITY; PALO ALTO 

CC STATE: CALIFORNIA 

CC COUNTRY: USA 

CC ZIP: 94304 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS'DQS 

CC SOFTWARE: Word Perfect 6,1 for Windows/MS-DOS 6.2 

CC CURRENT APPLICATION DATA; 

CC APPLICATION NUMBER: US/09/023,655 

CC FILING DATE: HEREWITH 

CC CLASSIFICATION: 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: 

CC FILING DATE: 

CC CLASSIFICATION; 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME; Zeller, Karen J. 

CC REGISTRATION NUMBER: 37,071 

CC REFERENCE/DOCKET NUMBER: PA-0001 US 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (650) 855-0555 

CC TELEFAX; (650) 845-4166 

CC INFORMATION FOR SEQ ID NO: 813: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1323 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC IMMEDIATE SOURCE: 

CC LIBRARY: LUNGNOT04 

CC CLONE: 768545 

SQ SEQUENCE 1323 BP; 331 A; 312 C; 471 G; 204 T; 5 OTHER. 



Query Match 65.6%; Score 1023; DB 23; 

Best Local Similarity 98.4%; Pred. No. 0.00e+00; 
1091; Conservative 0; Mismatches 7; 



Matches 


Db 


68 


QY 


460 


Db 


128 


Qy 


520 


Db 


188 


Qy 


580 


Db 


248 


Qy 


640 


Db 


308 


Qy 


700 



Length 1323; 
Indels 11; Gaps 11; 



milllllllllllllll : ' 1 1 1 1 1 1 1 : III iimiimiiiiiiiiiii 



II IMIMMIMI MIIIIIIMI IIIIMMIII IMMMM III 



1 1 1 1 1 1 1 1 i 1 1 1 1 1 ! M I i 11 1 1 1 1 1 i 1 1 1 1 1 1 1 1 f I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M I 



MMMM MMMIIM! MMIMMMII IMMMi MINI I 'M 



MIMIII IMIIIIIIM' MIIMMIIMI MIIMM MMMM M 



Db 


368 


Qy 


760 


Db 


428 


Qy 


820 


Db 


486 


Qy 


880 


Db 


546 


Qy 


939 


Db 


606 


Qy 


999 


Db 


666 


Qy 


1058 


Db 


726 


Qy 


1118 


Db 


786 


Qy 


1178 


Db 


846 


Qy 


1238 


Db 


906 


Qy 


1298 


Db 


966 


Qy 


1354 


Db 


1026 


Qy 


1412 


Db 


1086 


Qy 


1471 


Db 


1146 


Qy 


1531 



mmiiiminiinniiiiinniMniniinniiniinninn 



i-i i, in ii ii in iiiiiiiiiiiiiiiiiiiiimiiiiimi 



Mlllllllllllllllllllillllllllll IIIIIIIMIIMIIIIIlllllllll 



lll!lllllllllllllllllllMlllilllllll!IIIIIIIIIIIMI!llllllll 



MM II Mh Mill II IN I, 111 M IMII I'M 



iiiiiiiiiiiiiiiiiiiiiiiiiiMiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 



lllllllllllllllllllllllllllllllllllllllllillllllllllllllllll 



llllllllllllllllilllllllllllllllllllllllllllllllllllllllllll 



MIM II MM Mill II MM M MM MMMM MM IM 



GGCCGGGGCTGAAGTTTACTACAGTGGTACCTGGGGGACAAATTTGCGATGACGGAGTGG 965 

i inn iiiiiiijiiiiiiiiiiiiiiiiiiiiiiiii illinium inn 



i i ii i i ii iim n niiiiiiiinnii nniinnimii 



i i m ri ii iniiiiniiiiiiiiiiiiiiiiiiiiiiiiiniiii 



i inniiiiiiiniiniiiiiiiiiiniiiiiiiiiiiiminiinniiii 



Ml II III II MM IN 



RESULT 
ID 
AC 
DT 
DE 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



US-08-837-312-1335 STANDARD; 
xxxxxx 



DNA; UNC; 775 BP. 



Sequence 1335, Application US/08837312 
Sequence 1335, Application US/08837312 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Jacobs, Kenneth 
McCoy, John 
LaVallie, Edward 
Racie, Lisa 
Merberg, David 
Treacy, Maurice 
Spaulding, Vikki 



TITLE OF INVENTION: SECRETED, EXPRESSED SEQUENCE TAGS 



Fri Sep 11 07:51:44 1998 



US-08-794-795-5.rnp 



Page 6 



CC NUMBER OF SEQUENCES: 1519 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Genetics Institute, Inc. 

CC STREET: 87 CambridgePark Drive 

CC CITY: Cambridge 

CC STATE: Massachusetts 

CC COUNTRY: U.S. A 

CC ZIP: 02140 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy Disk 

CC COMPUTER: IBM PC Compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1.0, version #1.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/837,312 

CC FILING DATE: 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Brown, Scott A. 

CC REGISTRATION NUMBER: 32,724 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (617) 498-8224 

CC TELEFAX: (617) 876-5851 

CC INFORMATION FOR SEQ ID NO; 1335; 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH; 775 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: double 

CC TOPOLOGY; linear 

CC MOLECULE TYPE: cDNA 

SQ SEQUENCE 775 BP; 184 A; 221 C; 244 G; 126 T; 0 OTHER. 



Query Match 47,6%; 
Best Local Similarity 100.0%; 
742; Conservative 



Matches 


Db 


22 


Qy 


33 


Db 


82 


Qy 


93 


Db 


142 


Qy 


153 


Db 


202 


Qy 


213 


Db 


262 


Qy 


273 


Db 


322 


Qy 


333 


Db 


382 


Qy 


393 


Db 


442 


Qy 


453 


Db 


502 


Qy 


513 


Db 


562 



Score 742; DB 21; Length 775; 
Pred. No. 0.00e+00; 
0; Mismatches 0; Indels 0; 



Gaps 0; 



llllllllllllllllllllllllimilllllllllllllllllllllllllllllll 



CAATGTTCCAMGCCCMGAGGAGAMTGGGGTGAACTTCTCCCTAGCTGTGGTGGTCAT 141 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimmiiiiiiiiiiiiMiim 



MIIIIM II1IIII! IIUIMII MUM IIIIIIIIIMIIIM 



lllllllllllllllllllllllllllllllllllllllllllllllllllllllllll! 



iiiimiiiiiiiiiiiiiimiiimiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 



iiiiimiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiimiiiiiiiiiii 



iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii 



IMIlM: .1.1.1 MINI ,IM.I U'.IIIU I I I 



IIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



562 CTCGGGACCCCAAGGCCCACCGGGAGTCAAGGGAGAGGCGGGCCTCCAAGGACCCCAGGG 621 



llllllllllllllllllllllllllllllimilllllllllllllllllllllllll 



Qy 


573 


CTCGGGACCCCAAGGCCCACCGGGAGTCAAGGGAGAGGCGGGCCTCCAAGGACCCCAGGG 


632 


Db 


622 


TGCTCCAGGGAAGCAAGGAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGGGCAGCAA 


681 






1 1 ! 1 1 1 1 ! 1 1 ! 1 1 II 1 1 1 1 1 I 1 II 1 1 ! 1 1 If 1 1 It 1 M 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 II 1 1 1 II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 II 1 II 1 II M 1 1 M 1 1 1 




Qy 


633 


TGCTCCAGGGAAGCAAGGAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGGGCAGCAA 


692 


Db 


682 


AGGCGATGGGGGTCTCATTGGCCCAAAAGGGGAAACTGGAACTAAGGGAGAGAAAGGAGA 


741 






IIIIIIIIMIIIIMIIIIIIIIIIIillllllMIIMIIIIMIllllMIIIMI! 




Qy 


693 


AGGCGATGGGGGTCTCATTGGCCCAAAAGGGGAAACTGGAACTAAGGGAGAGAAAGGAGA 


752 


Db 


742 


CCTGGGTCTCCCAGGAAGCAAA 763 








MIIIIIIIIIIMIIIIIIII 




Qy 


753 


CCTGGGTCTCCCAGGAAGCAAA 774 





RESULT 
ID 
AC 
DT 
DE 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
SQ 



US-08*893-467A-l STANDARD; 
xxxxxx 



DNA; UNC; 1868 BP. 



An Insolated DNA Sequence For a 
Novel Macrophage Receptor with 
a Collagenous Domain and the 
Polypeptide Chain Encoded by 
such a Sequence 
2 



Sequence 1, Application US/08893467A 
Sequence 1, Application US/08893467A 
GENERAL INFORMATION: 
APPLICANT; Tryggvason, Karl 
APPLICANT: Elomaa, Outi 
APPLICANT: Kangas, Maarit 
TITLE OF INVENTION: 
TITLE OF INVENTION: 
TITLE OF INVENTION: 
TITLE OF INVENTION: 
TITLE OF INVENTION: 
NUMBER OF SEQUENCES 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: Fay, Sharpe, Beall, Fagan, 
ADDRESSEE: Minnich & McKee 
STREET; 1100 Superior Avenue 
STREET: Suite 700 
CITY: Cleveland 
STATE: Ohio 
COUNTRY: U.S.A. 
ZIP: 44114-2518 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 3.50 inch, 
MEDIUM TYPE: 720 Kb storable 
COMPUTER: IBM PS/2, Model 35 SX 
OPERATING SYSTEM: DOS 5.0 
SOFTWARE: Word Perfect 5.1 
CURRENT APPLICATION DATA: 
APPLICATION NUMBER: US/08/893, 467A 
FILING DATE; 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: Minnich, Richard J. 
REGISTRATION NUMBER: 24,175 
REFERENCE/DOCKET NUMBER: TRV 2 009 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (216) 861-5582 
TELEFAX: (216) 241-1666 
TELEX ; (216) 980162 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS; 
LENGTH; 1868 base pairs 
TYPE; Nucleic acid 
STRANDEDNESS: Single 
TOPOLOGY: Linear 
MOLECULE TYPE: Nucleotide-genomic DNA 
HYPOTHETICAL: Not relevant 
ANTI -SENSE; Not relevant 
SEQUENCE 1868 BP; 524 A; 433 C; 576 G; 335 T; 0 OTHER. 



Query Match 42.7%; 
Best Local Similarity 75.1%; 
Matches 1110; Conservative 



Score 666; DB 21; Length 1868; 
Pred. No. 0.00e+00; 

0; Mismatches 354; Indels 15; Gaps 6; 
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Db 


247 


Qy 


76 


Db 


307 


Qy 


136 


Db 


367 


Qy 


196 


Db 


427 


Qy 


256 


Db 


481 


Qy 


316 


Db 


541 


Qy 


376 


Db 


601 


Qy 


436 


Db 


652 


Qy 


496 


Db 


712 


Qy 


556 


Db 


772 


Qy 


616 


Db 


832 


Qy 


676 


Db 


892 


Qy 


736 


Db 


952 


Qy 


796 


Db 


1012 


Qy 


856 


Db 


1072 


Qy 


916 


Db 


1132 


Qy 


976 


Db 


1192 


Qy 


1036 


Db 


1252 


Qy 


1096 



nun i iiniiiiiiiii mi iiiiiiii iiiiniiiii ii inn i 



i n ii in mi iiiiiiiiiiiiini ii n in nun n i 



iiiiiiii minim ii urn i i n inn i inn 



ii ii mini ii i linn iiiimm mi n 1 1 m 



MM I I Ml I I Mill I Ill MM IIIIIIII 



Mill Mill 



mm i i him inn ii mi i urn 



M MIMIIMI MM I IMIMI Ml 



•AAGGGGCCCCGGGTGCT-CCT 651 

MIMI II II I III 



m mi ii I ill ii ill nun inn i inn m n 



mm i mi i iiiiiiii imiimi m mimmmim mi 



mil mi Mini milium hi inn i iiiiiiii i i 



mi Milium . ii iimii mm 11 in hi m 



'I'll Mill Mill II II IMIMI IMIIIIIIM lllllllll III 



II I Ml IIIMII I IIIIIIII HIM MIMIIMI II I III 



MM IMMMIMMMI I III MM Mill M IIMII IIIIMM II 



IIIMII IMIMI IIMII MM It 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M M 



I MM IIIMII IIIIMM II II I I MM IIMII 



II M 1 1 1 1 1 1 M 1 1 1 II lllllllll IHIIIIIIM MM I IMIIIMM 



IIIIMM Mill I II Ml IIIIIIII II Mill mini I 





1312 


Ov 


1156 




1379 


Ov 

uy 




nh 




Ov 


1276 


nh 

Uu 


14Q9 


Ov 

uy 


1336 


nh 

uu 


1^9 


Qy 


1396 


Db 


1612 


Qy 


1456 


Db 


1672 


Qy 


1516 




lllll Mil IIIMII II II MM M III IIIMII MIMI 



II IMIIMI IIIIIIMMII Ml III III III I MM II 



II Ml I I III IIIMII Ml I M I I IMIIMM 



Mill lllll II Ml I Ml IIMII I lllllllll MM MIMI 



MM III II II M II MM MM Mill I MM! Ml 



M I MIMIIMM MM I lllll lllll MM IIIMII II 



II I III II I II II II II Mill III 



RESULT 7 

ID US-09-057-719-358 STANDARD; DNA; UNC; 311 BP. 

AC xxxxxx 
DT 

DE Sequence 358, Application US/09057719 

CC Sequence 358, Application US/09057719 

CC GENERAL INFORMATION: 

CC APPLICANT; Edwards, Jean-Baptiste Dumas Milne 

CC APPLICANT: Duclert, Aymeric 

CC TITLE OF INVENTION: EXPRESSED SEQUENCE TAGS FOR SECRETED 

CC TITLE OF INVENTION: PROTEINS 

CC NUMBER OF SEQUENCES: 1207 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Knobbe, Martens, Olson & Bear 

CC STREET: 501 West Broadway 

CC CITY: San Diego 

CC STATE: California 

CC COUNTRY: USA 

CC ZIP: 92101-3505 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy Disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: Win95 

CC SOFTWARE: Word 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/09/057,719 

CC FILING DATE: 

CC CLASSIFICATION: 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Israelsen, Ned A. 

CC REGISTRATION NUMBER: 29,555 

CC REFERENCE/DOCKET NUMBER: GENSET.025A 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (619) 235-8550 

CC TELEFAX: (619) 235-0176 

CC INFORMATION FOR SEQ ID NO: 358: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 311 base pairs 

CC TYPE: NUCLEIC ACID 

CC STRANDEDNESS: DOUBLE 

CC TOPOLOGY: LINEAR 

CC MOLECULE TYPE: CDNA 

CC ORIGINAL SOURCE: 

CC ORGANISM: Homo Sapiens 
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CC TISSUE TYPE; Liver 

CC FEATURE; 

CC NAME/KEY: sig_peptide 

CC LOCATION: 62.. 172 

CC IDENTIFICATION METHOD: Von Heijne matrix 

CC OTHER INFORMATION: score 5.6 

CC OTHER INFORMATION: seq VIYLILLTAGAGL/LV 

SO SEQUENCE 311 BP; 65 A; 87 C; 85 G; 73 T; 1 OTHER, 



Query Match 19.8%; 
Best Local Similarity 99.4%; 
Matches 309; Conservative 



Score 309; DB 24; Length 311; 
Pred. No. 2.32e-284; 

1; Mismatches 1; Indels 0; Gaps 0; 



Db 



Qy 

Db 

Qy 



Db 



1 AATTCTCAAGGAGGACGAGCTCTTGAGTGAGACCCAACAAGCTGCTTTTCACCAAATTGC 60 

I Ml MMI Ml Mill Ml Mill II II MM MMM 
15 AATTCTCAAGGAGGACGAGCTCTTGAGTGAGACCCAACAAGCTGCTTTTCACCAAATTGC 74 

61 AATGGAGCCTTTCGAAATCAATGTTCCAAAGCCCAAGAGGAGAMATGGGGTGAACTTCTC 120 

M ; M IMMM 'M MMI Ml Mill M - MM I M'M 
75 AATGGAGCCTTTCGAAATCAATGTTCCAAAGCCCAAGAGGAGAAATGGGGTGAACTTCTC 134 



121 CCTAGCTGTGGTGGTCATCTACCTGATCCTGCTCACCGCTGGCGCTGGGCTGCTGGTGGT 180 
M MMMM, M MM II MIM I I MM I M'M 
Qy 135 CCTAGCTGTGGTGGTCATCTACCTGATCCTGCTCACCGCTGGCGCTGGGCTGCTGGTGGT 194 

Db 181 CCAAGTTCTGAATCTGCAGGCGCGGCTCCGGGTCCTGGAGATGTATTTCCTCAATGACAC 240 

IM MMMMM M, MM, II MIM M II MM M Ml 
Qy 195 CCAAGTTCTGAATCTGCAGGCGCGGCTCCGGGTCCTGGAGATGTATTTCCTCAATGACAC 254 

Db 241 TCTGGCGGCTGAGGACAGCCCGTCCTTCTCCTTGCTGCAGTCAGCACACCCTGGAGAACA 300 

M Ml MMI Ml Mill IM MMM M M MM M MM 
Qy 255 TCTGGCGGCTGAGGACAGCCCGTCCTTCTCCTTGCTGCAGTCAGCACACCCTGGAGAACA 314 

Db 301 CCTGGCTCGGG 311 

IM I II 
Qy 315 CCTGGCTCAGG 325 



RESULT 8 

ID US-08-801-504-148 STANDARD; DNA; UNC; 269 BP. 
AC xxxxxx 

DT 
DE 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



Sequence 148. Application DS/08801504 
Sequence 148, Application US/08801504 
GENERAL INFORMATION: 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 

APPLICANT 



Stuart, Susan G. 
Gooding, Douglas H. 
John C. Lane 
Angelo M, Delegeane 
James I. Snable 

TITLE OF INVENTION: POLYNUCLEOTIDES AND POLYPEPTIDES DERIVED FROM 
TITLE OF INVENTION: OSTEOARTHRITIC SYNOVIUM 
NUMBER OF SEQUENCES: 2850 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 
STREET: 3174 PORTER DRIVE 
CITY: PALO ALTO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Word Perfect 6,1 for Windows/MS-DOS 6.2 
CURRENT APPLICATION DATA: 
APPLICATION NUMBER: US/08/801,504 
FILING DATE: HEREWITH 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 
APPLICATION NUMBER: 60/009,710 
FILING DATE: DECEMBER 19, 1995 
PRIOR APPLICATION DATA: 



CC APPLICATION NUMBER: 60/010,009 

CC FILING DATE: January 4, 1996 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: CERRONE, MICHAEL C. 

CC REGISTRATION NUMBER: 39,132 

CC REFERENCE/DOCKET NUMBER: PD-0099 US 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 855-0555 

CC TELEFAX: (415) 845-4166 

CC INFORMATION FOR SEQ ID NO: 148: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 269 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: cDNA 

CC IMMEDIATE SOURCE: 

CC CLONE: 722411 

SQ SEQUENCE 269 BP; 48 A; 82 C; 80 G; 59 T; 0 OTHER. 



Query Match 17.2%; 
Best Local Similarity 100,0%; 
Matches 269; Conservative 



Score 269; DB 21; Length 269; 
Pred. No, 4.20e-242; 

0; Mismatches 0; Indels 0; Gaps 0; 



Db 



Qy 



Db 



1 AAGCCCAAGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATCTACCTGATC 60 
llllllllllllllllllllllllllllllllllllltllllllllllllllllllllll 
103 AAGCCCAAGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATCTACCTGATC 162 



61 CTGCTCACCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGCGCGGCTC 120 
M I IM I I Ml I Mill M M MIM M MM 
Qy 163 CTGCTCACCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGCGCGGCTC 222 



Db 



Qy 

Db 



121 CGGGTCCTGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGTCCTTC 180 

IM M MM I! M MM I! MMMM II MIM M 1 MMM 
223 CGGGTCCTGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGTCCTTC 282 



181 TCCTTGCTGCAGTCAGCACACCCTGGAGAACACCTGGCTCAGGGTGCATCGAGGCTGCAA 240 
IM M MM II M MM II MMMM II MIM IM IMM : 
Qy 283 TCCTTGCTGCAGTCAGCACACCCTGGAGAACACCTGGCTCAGGGTGCATCGAGGCTGCAA 342 

Db 241 GTCCTGCAGGCCCAACTCACCTGGGTCCG 269 

M M MM II II MM M 
Qy 343 GTCCTGCAGGCCCAACTCACCTGGGTCCG 371 



Sequence 1236, Application US/60039051 
Sequence 1236, Application US/60039051 
GENERAL INFORMATION: 



RESULT 9 

ID US-60-039-051-1236 STANDARD; 
AC XXXXXX 
DT 
DE 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



DNA; UNC; 252 BP. 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Gooding, Douglas H, 
Stuve, Laura L. 
Stuart, Susan G. 
I to, Laura Y. 
Akerblom, Ingrid E. 
Delegeane, Angelo M. 
Naughton, Rebecca E. 
Klingler, Tod M. 

TITLE OF INVENTION: POLYNUCLEOTIDES AND POLYPEPTIDES DERIVED FROM 
TITLE OF INVENTION: HUMAN LUNG 
NUMBER OF SEQUENCES: 1811 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 
STREET: 3174 PORTER DRIVE 
CITY: PALO ALTO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
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CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Word Perfect 6.1 for Windows/MS-DOS 6.2 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/60/039,051 

CC FILING DATE: 

CC CLASSIFICATION: 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: CERRONE, MICHAEL C. 

CC REGISTRATION NUMBER: 39,132 

CC REFERENCE/DOCKET NUMBER: PD-0328P 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 855-0555 

CC TELEFAX: (415) 845-4166 

CC INFORMATION FOR SEQ ID NO: 1236: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 252 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: cDNA 

CC IMMEDIATE SOURCE: 

CC CLONE: 2591776H1 

SQ SEQUENCE 252 BP; 70 A; 54 C; 96 G; 32 T; 0 OTHER, 

Query Match 16.2%; Score 252; DB 5? Length 252; 

Best Local Similarity 100.0%; Pred. No. 3.24e-224; 

Matches 252; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Db 1 GGAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGGGCAGCAAAGGCGATGGGGGTCTC 60 

IlilMlllllllllllllllllllllllllllllllllllllllllllljllMIIIII 
Qy 649 GGAGCCACTGGCACCCCAGGACCCCAAGGAGAGAAGGGCAGCAAAGGCGATGGGGGTCTC 708 

Db 61 ATTGGCCCAAAAGGGGAAACTGGAACTAAGGGAGAGAAAGGAGACCTGGGTCTCCCAGGA 120 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIII 
Qy 709 ATTGGCCCAAAAGGGGAAACTGGAACTAAGGGAGAGAAAGGAGACCTGGGTCTCCCAGGA 768 

Db 121 AGCAAAGGGGACAGGGGCATGAAAGGAGATGCAGGGGTCATGGGGCCTCCTGGAGCCCAG 180 

M IM III M MMM IMM MMI MM MM. IM MM 
Qy 769 AGCAAAGGGGACAGGGGCATGAAAGGAGATGCAGGGGTCATGGGGCCTCCTGGAGCCCAG 828 

Db 181 GGGAGTAAAGGTGACTTCGGGAGGCCAGGCCCACCAGGTTTGGCTGGTTTTCCTGGAGCT 240 

MMM MIMM MIMM IMM IMM MM 1 MMIIMI MIM 
Qy 829 GGGAGTAAAGGTGACTTCGGGAGGCCAGGCCCACCAGGTTTGGCTGGTTTTCCTGGAGCT 888 

Db 241 AAAGGAGATCAA 252 

illinium 

Qy 889 AAAGGAGATCAA 900 



RESULT 10 

ID US-08-808-904-5073 STANDARD; DNA; UNC; 250 BP. 

AC xxxxxx 

DT 

DE Sequence 5073, Application US/08808904 

CC Sequence 5073, Application US/08808904 

CC GENERAL INFORMATION: 

CC APPLICANT: Gooding, Douglas H. 

CC APPLICANT: Stuve, Laura L. 

CC APPLICANT: Stuart, Susan G. 

CC APPLICANT; Ito, Laura Y. 

CC APPLICANT: Akerblom, Ingrid E. 

CC APPLICANT: Delegeane, Angelo M. 

CC APPLICANT: Naughton, Rebecca E. 

CC APPLICANT: Klingler, Tod M. 

CC TITLE OF INVENTION: POLYNUCLEOTIDES AND POLYPEPTIDES DERIVED FROM 

CC TITLE OF INVENTION: ASTHMATIC LUNG 

CC NUMBER OF SEQUENCES: 5094 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

CC STREET: 3174 PORTER DRIVE 

CC CITY: PALO ALTO 

CC STATE: CALIFORNIA 

CC COUNTRY: USA 



CC 


ZIP: 94304 


CC 


COMPUTER READABLE FORM: 


CC 


MEDIUM TYPE: Floppy disk 


CC 


COMPUTER: IBM PC compatible 


CC 


OPERATING SYSTEM: PC -DOS/MS -DOS 


CC 


SOFTWARE: Word Perfect 6.1 for Windows/MS-DOS 6.2 


CC 


CURRENT APPLICATION DATA: 


CC 


APPLICATION NUMBER: US/08/808, 204 


CC 


FILING DATE: HEREWITH 


CC 


CLASSIFICATION: 536 


CC 


PRIOR APPLICATION DATA: 


CC 


APPLICATION NUMBER: 60/012,698 


CC 


FILING DATE: FEBRUARY 29, 1996 


CC 


CLASSIFICATION: 536 


CC 


PRIOR APPLICATION DATA: 


CC 


APPLICATION NUMBER; 60/013,365 


CC 


FILING DATE: MARCH 13, 1996 


CC 


CLASSIFICATION: 536 


CC 


PRIOR APPLICATION DATA: 


CC 


APPLICATION NUMBER: 60/027,122 


CC 


FILING DATE: SEPTEMBER 27, 1996 


CC 


CLASSIFICATION: 536 


CC 


ATTORNEY/AGENT INFORMATION: 


CC 


NAME: CERRONE, MICHAEL C. 


CC 


REGISTRATION NUMBER: 39,132 


CC 


REFERENCE/DOCKET NUMBER: PD-0120 US 


CC 


TELECOMMUNICATION INFORMATION: 


CC 


TELEPHONE: (415) 855-0555 


CC 


TELEFAX: (415) 845*4166 


CC 


INFORMATION FOR SEQ ID NO: 5073: 


CC 


SEQUENCE CHARACTERISTICS: 


CC 


LENGTH: 250 base pairs 


CC 


TYPE: nucleic acid 


CC 


STRANDEDNESS: single 


CC 


TOPOLOGY: linear 


CC 


MOLECULE TYPE: cDNA 


CC 


IMMEDIATE SOURCE: 


CC 


CLONE: 1989038 



SQ SEQUENCE 250 BP; 58 A; 54 C; 82 G; 56 T; 0 OTHER. 

Query Match 16.0%; Score 250; DB 21; Length 250; 

Best Local Similarity 100.0%; Pred. No. 4 . 10e-222; 

Matches 250; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Db 1 ATTGTCGGCAGTAGTAACCGAGGCCGGGCTGAAGTTTACTACAGTGGTACCTGGGGGACA 60 

IM MM II Ml MM I IMM MM MMI M IMM M 
Qy 1276 ATTGTCGGCAGTAGTAACCGAGGCCGGGCTGAAGTTTACTACAGTGGTACCTGGGGGACA 1335 

Db 61 ATTTGCGATGACGAGTGGCAAAATTCTGATGCCATTGTCTTCTGCCGCATGCTGGGTTAC 120 

II! MM II MM 1 MM MIMM MM MM' M MM IM 
Qy 1336 ATTTGCGATGACGAGTGGCAAAATTCTGATGCCATTGTCTTCTGCCGCATGCTGGGTTAC 1395 

Db 121 TCCAAAGGAAGGGCCCTGTACAAAGTGGGAGCTGGCACTGGGCAGATCTGGCTGGATAAT 180 

I II" M 'Ml M I ' M I IM I IMM MM M 
Qy 1396 TCCAAAGGAAGGGCCCTGTACAAAGTGGGAGCTGGCACTGGGCAGATCTGGCTGGATAAT 1455 

Db 181 GTTCAGTGTCGGGGCACGGAGAGTACCCTGTGGAGCTGCACCAAGAATAGCTGGGGCCAT 240 

M^ MM II MM MM II MM MM MMMM MIM MM 
Qy 1456 GTTCAGTGTCGGGGCACGGAGAGTACCCTGTGGAGCTGCACCAAGAATAGCTGGGGCCAT 1515 

Db 241 CATGACTGCA 250 

MMM 
Qy 1516 CATGACTGCA 1525 



RESULT 11 

ID US-08-770-384-3424 STANDARD; DNA; UNC; 251 BP. 

AC xxxxxx 

DT 

DE Sequence 3424, Application US/08770384 

CC Sequence 3424, Application US/08770384 
CC GENERAL INFORMATION: 
CC APPLICANT: Stuart, Susan G. 
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CC APPLICANT: Gooding, Douglas H. 

CC APPLICANT: Lane, John C, 

CC APPLICANT: Delegeane, Angelo M, 

CC APPLICANT: Snable, John L. 

CC TITLE OF INVENTION: POLYNUCLEOTIDES AND POLYPEPTIDES DERIVED FROM 

CC TITLE OF INVENTION: RHEUMATOID SYNOVIUM 

CC NUMBER OF SEQUENCES: 5501 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: INCYTE PHARMACEUTICALS, INC, 

CC STREET: 3174 PORTER DRIVE 

CC CITY; PALO ALTO 

CC STATE: CALIFORNIA 

CC COUNTRY: USA 

CC ZIP: 94304 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Word Perfect 5.1 for Windows/MS-DOS 6.2 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/770,384 

CC FILING DATE: HEREWITH 

CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: 60/009,486 

CC FILING DATE: December 15, 1995 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: 60/009,132 

CC FILING DATE: DECEMBER 22, 1995 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME; CERRONE, MICHAEL C. 

CC REGISTRATION NUMBER: 39,132 

CC REFERENCE/DOCKET NUMBER; PD-0096 US 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE; (415) 855-0555 

CC TELEFAX; (415) 845-4166 

CC INFORMATION FOR SEQ ID NO; 3424: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 251 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: single 

CC TOPOLOGY; linear 

CC MOLECULE TYPE: CDNA 

CC IMMEDIATE SOURCE: 

CC CLONE: 705645 

SQ SEQUENCE 251 BP; 48 A; 69 C; 67 6; 63 T; 4 OTHER. 

Query Match 15.8%; Score 247; DB 20; Length 251; 

Best Local Similarity 98.4%; Pred. No, 5,79e-219; 

Matches 247; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 

Db 1 GCTGCTTTTCACCAAATTGCAATGGAGCCTTTCGAAATCAATGTTCCAAAGCCCAAGAGG 60 

IMP MM II ll!l Mill !l MM 1 M Mil MM I 

Qy 55 GCTGCTTTTCACCAAATTGCAATGGAGCCTTTCGAAATCAATGTTCCAAAGCCCAAGAGG 114 

Db 61 AGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATCTACCTGATCCTGCTCACCGCT 120 

IMI Mill MMIM MM M MM M MM MMI 

Qy 115 AGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATCTACCTGATCCTGCTCACCGCT 174 

Db 121 GGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGCGNGGCTCCGGGTCCTGGAG 180 

MMMM MIIM iM MIM MM M III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Qy 175 GGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGCGCGGCTCCGGGTCCTGGAG 234 

Db 181 ATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGTCCTTCTNCTTGCTGCAG 240 

I M! M'M M MM MMIM MM: M M MM 

Qy 235 ATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGTCCTTCTCCTTGCTGCAG 294 

Db 241 TCAGNACANCC 251 

INI III II 

Qy 295 TCAGCACACCC 305 



RESULT 12 



ID US-09-016-434-839 STANDARD; DNA; UNC; 246 BP. 

AC xxxxxx 

DT 

DE Sequence 839, Application US/09016434 

CC Sequence 839, Application US/09016434 

CC GENERAL INFORMATION: 

CC APPLICANT; Janice Au-Young 

CC APPLICANT: Jeffrey J. Seilhamer 

CC TITLE OF INVENTION: COMPOSITION FOR THE DETECTION OF SIGNALING 

CC TITLE OF INVENTION; PATHWAY GENE EXPRESSION 

CC NUMBER OF SEQUENCES: 1490 

CC CORRESPONDENCE ADDRESS; 

CC ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

CC STREET: 3174 PORTER DRIVE 

CC CITY: PALO ALTO 

CC STATE: CALIFORNIA 

CC COUNTRY: USA 

CC ZIP; 94304 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Word Perfect 6.1 for Windows/MS-DOS 6.2 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/09/016,434 

CC FILING DATE; HEREWITH 

CC CLASSIFICATION: 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: 

CC FILING DATE: 

CC CLASSIFICATION: 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME; Zeller, Karen J. 

CC REGISTRATION NUMBER: 37,071 

CC REFERENCE/DOCKET NUMBER: PA-0002 US 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (650) 855-0555 

CC TELEFAX: (650) 845-4166 

CC INFORMATION FOR SEQ ID NO: 839; 

CC SEQUENCE CHARACTERISTICS; 

CC LENGTH: 246 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS; single 

CC TOPOLOGY: linear 

CC IMMEDIATE SOURCE: 

CC LIBRARY: MMLR3DT01 

CC CLONE: 569648 

SQ SEQUENCE 246 BP; 69 A; 44 C; 82 G; 49 T; 2 OTHER. 

Query Match 15.5%; Score 242; DB 23; Length 246; 

Best Local Similarity 98.8%; Pred. No. 1.02e-213; 

Matches 243; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Db 1 GGGAGACCAGGGAGTGAAAGGATCTTCTGGGGAGCAAGGAGTAAAGGGAGAAAAAGGTGA 60 

Mi I MM' n MM M MM llll! M MM M MIM 

Qy 1188 GGGAGACCAGGGAGTGAAAGGATCTTCTGGGGAGCAAGGAGTAAAGGGAGAAAAAGGTGA 1247 

Db 61 AAGAGGTGAAAACTCAGTGTCCGTCAAGATTNTCGGCAGTAGTAACCGAGGCCGGGCTGA 120 

llllllllllllllllllllllllll IMI 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 

Qy 1248 AAGAGGTGAAAACTCAGTGTCCGTCAGGATTGTCGGCAGTAGTAACCGAGGCCGGGCTGA 1307 

Db 121 AGTTTACTACAGTGGTACCTGGGGGACAATTTGCGATGACGAGTGGCAAAATTCTGATGC 180 

MM MM I MM I I I M iMM MM M MM 1 II 

Qy 1308 AGTTTACTACAGTGGTACCTGGGGGACAATTTGCGATGACGAGTGGCAAAATTCTGATGC 1367 

Db 181 CATTGTCTTCTGCCGCATGCTGGGTTACTCCAAAGGAAGGGCCCTGTACAAAGTGGGAGC 240 

MIIM MMIM IM. II MM. II MM; Mill II MiM Ml 

Qy 1368 CATTGTCTTCTGCCGCATGCTGGGTTACTCCAAAGGAAGGGCCCTGTACAAAGTGGGAGC 1427 

Db 241 TNGCAC 246 

I MM 

Qy 1428 TGGCAC 1433 
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Sequence 163, Application US/60051749 
Sequence 163, Application US/60051749 
GENERAL INFORMATION: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 



RESULT 13 

ID US-60-051-749-163 STANDARD; 
AC xxxxxx 
DT 
DE 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 



DNA; UNC; 241 BP. 



CC 
CC 
SQ 



Gooding, Douglas H. 
Stuart, Susan G. 
Ito, Laura Y, 
Naughton, Rebecca E. 
TITLE OF INVENTION: POLYNUCLEOTIDES AND POLYPEPTIDES DERIVED FROM 
TITLE OF INVENTION: HUMAN LUNG 
NUMBER OF SEQUENCES: 3617 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 
STREET: 3174 PORTER DRIVE 
CITY: PALO ALTO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94304 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Word Perfect 6.1 for Windows/MS-DOS 6.2 
CURRENT APPLICATION DATA: 
APPLICATION NUMBER: US/60/051,749 
FILING DATE: HEREWITH 
CLASSIFICATION: 
ATTORNEY/AGENT INFORMATION: 
NAME: CERRONE, MICHAEL C, 
REGISTRATION NUMBER: 39,132 
REFERENCE/DOCKET NUMBER; PD-0388P 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 855-0555 
TELEFAX: (415) 845-4166 
INFORMATION FOR SEQ ID NO: 163: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 241 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: Single 
TOPOLOGY: linear 
MOLECULE TYPE: CDNA 
IMMEDIATE SOURCE; 
CLONE: 2684337H1 
SEQUENCE 241 BP; 57 A; 61 C; 64 G; 59 T; 0 OTHER. 



Query Match 15.4%; Score 241; DB 7; Length 241; 

Best Local Similarity 100.0%; Pred. No. 1.15e*212; 

Matches 241; Conservative 0; Mismatches 0; Indels 0; 



Gaps 0; 



Db 


l 


AAAATTCTCAAGGAGGACGAGCTCTTGAGTGAGACCCAACAAGCTGCTTTTCACCAAATT 


60 






III Ml MMIIMMi IM Ml IMIIMM IM II II Ml 




oy 


13 


AAAATTCTCAAGGAGGACGAGCTCTTGAGTGAGACCCAACAAGCTGCTTTTCACCAAATT 


72 


Db 


61 


GCAATGGAGCCTTTCGAAATCAATGTTCCAAAGCCCAAGAGGAGAAATGGGGTGAACTTC 


120 






1 MM MMMMIIM MM III IMIMM II il il MM 




Qy 


73 


GCAATGGAGCCTTTCGAAATCAATGTTCCAAAGCCCAAGAGGAGAAATGGGGTGAACTTC 


132 


Db 


121 


TCCCTAGCTGTGGTGGTCATCTACCTGATCCTGCTCACCGCTGGCGCTGGGCTGCTGGTG 


180 






M MM 'MMMMM II M MMMM' Ml II M MM 




Qy 


133 


TCCCTAGCTGTGGTGGTCATCTACCTGATCCTGCTCACCGCTGGCGCTGGGCTGCTGGTG 


192 


Db 


181 


GTCCAAGTTCTGAATCTGCAGGCGCGGCTCCGGGTCCTGGAGATGTATTTCCTCAATGAC 


240 






M M MMMMIM IM III IMIMM 1! M il IM 




Qy 


193 


GTCCAAGTTCTGAATCTGCAGGCGCGGCTCCGGGTCCTGGAGATGTATTTCCTCAATGAC 


252 


Db 


241 


A 241 




Qy 


253 


1 

A 253 





APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Gooding, Douglas H. 
Stuve, Laura L. 
Stuart, Susan G. 
Ito, Laura Y. 
Akerblom, Ingrid E. 
Delegeane, Angelo M. 
Naughton, Rebecca E. 
Klingler, Tod M. 

C TITLE OF INVENTION; POLYNUCLEOTIDES AND POLYPEPTIDES DERIVED FROM 

C TITLE OF INVENTION: HUMAN LUNG 

C NUMBER OF SEQUENCES: 3314 

C CORRESPONDENCE ADDRESS: 

C ADDRESSEE : INCYTE PHARMACEUTICALS, INC. 

C STREET: 3174 PORTER DRIVE 

C CITY: PALO ALTO 

C STATE: CALIFORNIA 

C COUNTRY; USA 

C ZIP; 94304 

C COMPUTER READABLE FORM; 

C MEDIUM TYPE: Floppy disk 

C COMPUTER: IBM PC compatible 

C OPERATING SYSTEM; PC-DOS/MS-DOS 

C SOFTWARE: Word Perfect 6.1 for Windows/MS-DOS 6.2 

C CURRENT APPLICATION DATA: 

C APPLICATION NUMBER: US/08/810,326 

C FILING DATE: HEREWITH 

C CLASSIFICATION; 435 

C PRIOR APPLICATION DATA; 

C APPLICATION NUMBER: 60/012,699 

C FILING DATE; FEBRUARY 29, 1996 

C CLASSIFICATION: 435 

C PRIOR APPLICATION DATA: 

C APPLICATION NUMBER: 60/015,173 

C FILING DATE: APRIL 10, 1996 

C CLASSIFICATION: 435 

C ATTORNEY/AGENT INFORMATION: 

C NAME; CERRONE, MICHAEL C. 

C REGISTRATION NUMBER: 39,132 

C REFERENCE/DOCKET NUMBER: PD-Q121 US 

C TELECOMMUNICATION INFORMATION: 

C TELEPHONE: (415) 855-0555 

C TELEFAX: (415) 845-4166 

C INFORMATION FOR SEQ ID NO; 2282: 

C SEQUENCE CHARACTERISTICS; 

C LENGTH: 242 base pairs 

C TYPE: nucleic acid 

C STRANDEDNESS: single 

C TOPOLOGY: linear 

C MOLECULE TYPE: CDNA 

C IMMEDIATE SOURCE: 

C CLONE: 766295 

Q SEQUENCE 242 BP; 48 A; 66 C; 64 G; 63 T; 1 OTHER, 

Query Match 15.3%; Score 239; DB 21; Length 242; 

Best Local Similarity 99,2%; Pred. No. 1.43e-210; 

Matches 240; Conservative 0; Mismatches 2; Indels 0; Gaps 0 



Db 1 CAACAAGCTGCTTTTCACCAAATTGCAATGGAGCCTTTCGAAATCAATGTTCCAAAGCCC 60 

MMM M M M MMMM | ! MMMM M M MM 

Qy 49 CAACAAGCTGCTTTTCACCAAATTGCAATGGAGCCTTTCGAAATCAATGTTCCAAAGCCC 108 

Db 61 AAGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATCTACCTGATCCTGCTC 120 

MMM M :I -M-M M M MMMM' M M MM 

Qy 109 AAGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATCTACCTGATCCTGCTC 168 
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Db 121 ACCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGCGCGGCTCCGGGTC 180 

IIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIlllMllllllllllllllllillMII 
Qy 169 ACCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGCGCGGCTCCGGGTC 228 

Db 181 CTGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGTCCTTNTTCTTG 240 

IIIIIIIIIIIIIIIMIilllillllllllllllMIIIIIIIIIIMIIII I MM 
Qy 229 CTGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGTCCTTCTCCTTG 288 

Db 241 CT 242 

It 

Qy 289 CT 290 



RESULT 15 

ID US-08-770-384-3302 STANDARD; DNA; UNC; 228 BP. 

AC xxxxxx 

DT 

DE Sequence 3302, Application US/08770384 

CC Sequence 3302, Application US/08770384 

CC GENERAL INFORMATION; 

CC APPLICANT: Stuart, Susan G. 

CC APPLICANT; Gooding, Douglas H, 

CC APPLICANT: Lane, John C. 

CC APPLICANT; Delegeane, Angelo M, 

CC APPLICANT: Snable, John L. 

CC TITLE OF INVENTION; POLYNUCLEOTIDES AND POLYPEPTIDES DERIVED FROM 

CC TITLE OF INVENTION: RHEUMATOID SYNOVIUM 

CC NUMBER OF SEQUENCES; 5501 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: INCYTE PHARMACEUTICALS, INC. 

CC STREET: 3174 PORTER DRIVE 

CC CITY: PALO ALTO 

CC STATE: CALIFORNIA 

CC COUNTRY: USA 

CC ZIP: 94304 

CC COMPUTER READABLE FORM; 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Word Perfect 6.1 for Windows/MS-DOS 6.2 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/770,384 

CC FILING DATE: HEREWITH 

CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA; 

CC APPLICATION NUMBER; 60/009,486 

CC FILING DATE; December 15, 1995 

CC PRIOR APPLICATION DATA; 

CC APPLICATION NUMBER: 60/009,132 

CC FILING DATE; DECEMBER 22, 1995 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME; CERRONE, MICHAEL C. 

CC REGISTRATION NUMBER: 39,132 

CC REFERENCE/DOCKET NUMBER: PD-0096 US 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 855-0555 

CC TELEFAX; (415) 845-4166 

CC INFORMATION FOR SEQ ID NO: 3302: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 228 base pairs 

CC TYPE: nucleic acid 

CC STRANDEDNESS: single 

CC TOPOLOGY; linear 

CC MOLECULE TYPE: CDNA 

CC IMMEDIATE SOURCE : 

CC CLONE; 705186 

SQ SEQUENCE 228 BP; 48 A; 61 C; 63 G; 56 T; 0 OTHER. 

Query Match 14.6%; Score 228; DB 20; Length 228; 

Best Local Similarity 100.0%; Pred. No. 4.75e-199; 

Matches 228; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 50 AACAAGCTGCTTTTCACCAAATTGCAATGGAGCCTTTCGAAATCAATGTTCCAAAGCCCA 109 

Db 61 AGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATCTACCTGATCCTGCTCA 120 

M I I I M I M M M IIMMMM M M II Ml I 
Qy 110 AGAGGAGAAATGGGGTGAACTTCTCCCTAGCTGTGGTGGTCATCTACCTGATCCTGCTCA 169 

Db 121 CCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGCGCGGCTCCGGGTCC 180 

II I M II M M IMMIIIIIIIIIM IM Ml MUM llll 
Qy 170 CCGCTGGCGCTGGGCTGCTGGTGGTCCAAGTTCTGAATCTGCAGGCGCGGCTCCGGGTCC 229 

Db 181 TGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGT 228 

M M I M II M MIMMMMMM M II 
Qy 230 TGGAGATGTATTTCCTCAATGACACTCTGGCGGCTGAGGACAGCCCGT 277 

Search completed: Fri Sep 11 07:38:26 1998 
Job time ; 2292 sees. 



Db 1 AACAAGCTGCTTTTCACCAAATTGCAATGGAGCCTTTCGAAATCAATGTTCCAAAGCCCA 60 
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Release 3.1A John F. Collins, Biocomputing Research unit. 
Copyright. (C) 1993-1996 University of Edinburgh, tf.K. 
Distribution rights by oxford Molecular Ltd 

MPsrch_pp protein - protein database search, using Smith-Waterman algorithm 

Run on: Fri Sep 11 06:56:53 1998; MasPar time 5,44 Seconds 

675,321 Million cell updates/sec 

Tabular output not generated. 



Title: 

Description: 
Perfect Score: 
Sequence: 

Scoring table: 



Searched: 



MJS-08-794-795-6 

(1-520) from OS08794795 .pep 

3665 

1 MRNKKILKEDELLSETOQAA NSWGHHDCSHEEDAGVECSV 520 

PAM 150 
Gap 11 

77021 seqs, 7058996 residues 



Post-processing: Minimum Match 0% 

Listing first 45 summaries 

Database: a-issued 

l:5_OQMB 2:PCT9_COMB 3:backfilesl 

Statistics: Mean 32.821; Variance 173,861; scale 0.189 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Result 
No. 



Score 



Query 

Match Length DB 



SUMMARIES 



ID 



Description 



Pred. No. 



1 


2414 


65.9 


518 


1 


US-08-392- 


2 


796 


21.7 


357 


1 


US-08-642- 


3 


796 


21.7 


357 


1 


(JS-07-609- 


4 


793 


21.6 


1077 


1 


US-08-642- 


5 


793 


21.6 


1077 


1 


US-07-972- 


6 


771 


21.0 


1064 


1 


OS-08-642- 


7 


757 


20.7 


408 


1 


US-07-609- 


8 


755 


20,6 


777 


1 


US-08-642- 


9 


739 


20.2 


561 


1 


OS-08-642- 


10 


734 


20.0 


1442 


2 


PCT-US95-0 


11 


731 


19.9 


330 


1 


US-08-642* 


12 


722 


19.7 


417 


1 


US-08-175- 


13 


722 


19.7 


417 


1 


US-08-477- 


14 


722 


19.7 


417 


1 


US-08-642- 


15 


722 


19.7 


837 


1 


US-08-642- 


16 


722 


19.7 


837 


1 


US-08-175- 


17 


722 


19.7 


837 


1 


(JS-08-477- 


18 


722 


19.7 


897 


1 


US-Q8-397- 


19 


720 


19.6 


633 


1 


US-08-642* 


20 


720 


19.6 


829 


1 


US-08-397- 


21 


720 


19.6 


829 


1 


US-08-642- 


22 


703 


19.2 


1065 


1 


DS-08-642- 


23 


693 


18.9 


252 


1 


US-08-642* 



Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 



2, Applicatio 
33, Applicati 
66, Applicati 
95, Applicati 
82, Applicati 
62, Applicati 
65, Applicati 
53, Applicati 

52, Applicati 
12, Applicati 
32, Applicati 
69, Applicati 
104, Applicat 

102, Applicat 
101, Applicat 
68, Applicati 

103, Applicat 
50, Applicati 
73, Applicati 

53, Applicati 
132, Applicat 
72, Applicati 
61/ Applicati 



3.69e-182 

5.31e-52 

5.31e-52 

9.17e-52 

9,17e-52 

5.05e-50 

6.45e-49 

9.28e-49 

1.70e-47 

4.23e-47 

7.29e-47 

3,74e-46 

3,74e-46 

3.74e-46 

3.74e-46 

3.74e-46 

3.74e-46 

3.74e-46 

5.38e-46 

5,38e-46 

5.38e-46 

l.lBe-44 

7.23e-44 



24 


684 


18 7 


682 


1 


cs-o 


8-397- 


Sequence 36, Applicati 


25 


684 


18! 7 


682 


1 


fS-0 


8-642- 


Sequence 126, Applicat 


26 


674 


ii4 


546 


1 


us-o 


8-494- 


Sequence 10, Applicati 


27 


667 


If. 2 


684 


1 


us-o 


8-555- 


Sequence 12, Applicati 


28 


649 


11.7 


532 


1 


us-o 


8-494- 


Sequence 9, Applicatio 


29 


645 


I'. 6 


234 


1 


us-o 


8-642- 


Sequence 51, Applicati 


30 


629 


1'.2 


1065 


1 


us-o 


8-642- 


Sequence 80, Applicati 


31 


629 


17.2 


1065 


2 


PCT-US96-0 


Sequence 16, Applicati 


32 


609 


36.6 


762 


1 


US-O 


8-397- 


Sequence 26, Applicati 


33 


609 


36-6 


762 


1 


US-08-642- 


Sequence 114, Applicat 


34 


603 


36 .5 


762 


1 


US-08-642- 


Sequence 120, Applicat 


35 


603 


/.6.5 


762 


i 


us-o 


8-397- 


Sequence 31, Applicati 


36 


606 


:l6.s 


1694 


1 


US-0S-494- 


Sequence 2, Applicatio 


37 


598 


16.3 i 


540 


1 


us-o 


8-383- 


Sequence 5, Applicatio 


38 


598 


16.3| 


540 


1 


us-o 


8-383- 


Sequence 5, Applicatio 


39 


598 


l6.3 


540 


1 


US-O 


8-383- 


Sequence 5, Applicatio 


40 


597 


16.3 


543 


1 


us-o 


8-383- 


Sequence 16, Applicati 


41 


597" 


16.3 


54? 


1 


us-o 


8-383- 


Sequence 16, Applicati 


42 


597 


16.3 


£43 


1 


us-o 


8-383- 


Sequence 16, Applicati 


43 


595 


16.2 


270 


1 


us-o 


8-383- 


Sequence 15, Applicati 


44 


595 


16.2 


;:73 


1 


us-o 


8-383- 


Sequence 3, Applicatio 


45 


595 


16.2 


2E5 


1 


us-o 


8-383- 


Sequence 20, Applicati 



3.69e-43 
3.69e-43 
2.26e-42 
8.03e-42 
2.09e-40 
4.30e-40 
7.75e*39 
7.75e-39 
2.86e-37 
2.86e-37 
8.44e-37 
8.44e-37 
4.91e-37 
2.066-36 
2.08e-36 
2.08e-36 
2.49e-36 
2.49e-36 
2.49e-36 
3.57e-36 
3,57e-36 
3.57e-36 



ALIGNMENTS 



RESULT 1 



ID 
XX 
AC 
XX 
DT 
XX 
DE 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 



US-08-392-367B-2 STANDARD; PRT; 518 AA. 



xxxxxx 



Sequence 2, Application US/08392367B 

Sequence 2, Application US/08392367B 

Patent No. 5691197 

GENERAL INFORMATION: 
APPLICANT: Tryggvason, Karl 
APPLICANT: Elomaa, Outi 
APPLICANT: Kangas, Maarit 
TITLE OF INVENTION 

Patent No. 5691197 

TITLE OF INVENTION 

TITLE OF INVENTION 

TITLE OF INVENTION 
' TITLE OF INVENTION 

NUMBER OF SEQUENCES: 



An Insolated DNA Sequence For a 

No. 5691197el Macrophage Receptor with 
a Collagenous Domain and the 
Polypeptide Chain Encoded by 
such a Sequence 
2 



CORRESPONDENCE ADDRESS: 

ADDRESSEE: Fay, Sharpe, Beall, Fagan, 

ADDRESSEE: Minnich & McKee 

STREET: 1100 Superior Avenue 

STREET: Suite' 700 

CITY: Cleveland 

STATE: Ohio 

COUNTRY: U.S.A. 

ZIP: 44114-2518 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette, 3.50 inch, 

MEDIUM TYPE; 720 Kb storable 

COMPUTER: IBM PS/2, Model 35 SX 

OPERATING SYSTEM: DOS 5.0 

SOFTWARE: Word Perfect 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/392, 367B 

FILING DATE; 

CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: Minnich, Richard J. 

REGISTRATION NUMBER: 24,175 

REFERENCE/DOCKET NUMBER: TRV 2 009 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (216) 861-5582 
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CC TELEFAX: (216) 241-1666 

CC TELEX : (216) 980162 

CC INFORMATION FOR SEQ ID NO: 2: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 518 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: Single 

CC TOPOLOGY: Linear 

SO SEQUENCE 518 AA; 52730 MW; 1281000 CN; 

Query Match 65,9%; Score 2414; DB 1; Length 518; 

Best Local Similarity 67.3%; Pred. No. 3,69e-182; 

Matches 352; Conservative 73; Mismatches 89; Indels 9; Gaps 8; 

Db 1 MGSKELLKEEDFLGSTEDRADFDQAMFPVMETFEINDPVPKKRNGGTFCMAVMAIHLILL 60 

I :| :|H:::|: |:: ! I I : : II MM I IMMI I MM I MM 

Qy 1 MRNKKILKEDELLSETQQ-AAFHQ-I-A-MEPFEINVPKPKRRNGVNFSLAVWIYLILL 56 

Db 61 TAGTALLLIQVLNLQEQLQMLEMCCGNGSLAIEDKPFFSL-QWA-PKTHLVPRAQGLQAL 118 

ll|::||::|lllll :|::lll I :ll II I III) I I II I II I 
Qy 57 TAGAGLLWQVLNLQARLRVLEMYFLNDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVL 116 

Db 119 QAQLSWVHTSQEQLRQQFNNLTQNPELFQIKGERGSPG-P-KGAPGAPGIPGLPGPAAE 175 

II :| : |;|:| |: :|:|||| :|:||l|:|:|l III I II II MMII 
Qy 117 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAE 176 

Db 176 KGEKGAAGRDGTPGVQGPQGPPGSKGEAGLQGLTGAPGKQGATGAPGPRGEKGSKGDIGL 235 

II III Nil: I lllllll MM 1 Illlllllll:|||:|||||||| II 

Qy 177 KG AKG AMGRDG ATGPSG PQG P PGVKG EAGLQG PQG APG KQG ATGT PG PQG EKGSKGDGGL 236 

Db 236 TGPKGEHGTKGDKGDLGLPGNKGDMGMKGDTGPMGSPGAQGGKGDAGKPGLPGLAGSPGV 295 

Mill IIIMIIMMIMMI II: Ihllllhlll 1:11 Mill II 

Qy 237 IGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGA 296 

Db 296 KGDQGKPGVQGVPGPQGAPGLSGAKGEPGRTGLPGPAGPPGIAGNPGIAGVKGSKGDTGI 355 

Mill I : 1 1 J 1 1 1 II I MIIMI! : M M || :|:|| :MMMIMM 
Qy 297 KGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGL 356 

Db 356 QGQKGTKGESGVPGLVGRKGDTGSPGLAGPKGEPGRVGQKGDPGMKGSSGQQGQKGEKGQ 415 

iii i 1 1 ■ 1 1 1 i i ii: Milium ii: inn imiihii iiiim 

Qy 357 QGQQGRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGE 416 

Db 416 KGESFQRVRIMGGTNRGRAEVYYNNEWGTICDDDWDNNDATVFCRMLGYSRGRALSSYGG 475 

:||: MI:|::|IMIMII: 1 1 1 1 1 1 1 : 1 : 1 : 1 1 1 1 1 1 j 1 1 1 1 : 1 1 1 1 
Qy 417 RGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGA 476 

Db 476 GSGNIWLDNVNCRGTENSLWDCSKNSWGNHNCVHNEDAGVECS 518 

mi mm mmm mmmmmi mmiiimi 

Qy 477 GTGQIWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECS 519 



RESULT 2 

ID US-08-642-255-33 STANDARD; PRT; 357 AA, 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 33, Application US/08642255 
xx 

CC Sequence 33, Application US/08642255 

CC Patent No. 5773249 

CC GENERAL INFORMATION: 

CC APPLICANT: CAPPELLO, Joseph 

CC APPLICANT: FERRARI, Franco A. 

CC TITLE OF INVENTION: High Molecular Weight Collagen-Like 

CC TITLE OF INVENTION: Protein Polymers 

CC NUMBER OF SEQUENCES; 135 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: FLEHR, HOHBACH, TEST, ALBRITTON & HERBERT 

CC STREET: 4 Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: California 



CC COUNTRY: USA 

CC ZIP: 94111-4187 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release tl.0, Version 11.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/642,255 

CC FILING DATE: 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: ROWLAND, Bertram I. 

CC REGISTRATION NUMBER: 20,015 

CC REFERENCE/DOCKET NUMBER: A55556-3/BIR 

CC TELECOMMUNICATION INFORMATION; 

CC TELEPHONE: (415) 494-8700 

CC TELEFAX: (415) 494-8771 

CC TELEX; 910 277299 FHT UR 

CC INFORMATION FOR SEQ ID NO: 33: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 357 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY; linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 357 AA; 31954 MW; 670322 CN; 

Query Match 21.7%; Score 796; DB 1; Length 357; 

Best Local Similarity 48.4%; Pred. No. 5.31e-52; 

Matches ;134; Conservative 45; Mismatches 95; Indels 3; Gaps 2; 

Db 40 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 99 

I 1:11 I I: I IMIIIII I: 11 :| I II |::|| I I I 

Cy 148 GEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQ 207 

Db 100 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 159 

II hll II I III I I: I I I :|::| II I I :|: I I I ; 

Qy 208 GPQ^PGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDA 267 

Db 160 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 219 

I Mil: I: I I Mill :|:ll:lll:|: I I :| llhhll Ml I I 

Qy 268 GVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQ--P-GLQGVPGPPGAVGHPGAKGEP 324 

Db 220 GPPGP?GPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 279 

|::|:'U M IMIMIM I I I I :|: I II I Mill I MM 
Qy 325 GSAGSFGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLA 384 

Db 280 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAG 316 

II Mil M I I MM M I 1 1 : 1 1 : 

Qy 385 GPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENS 421 



RESULT 3 

ID US-07-609-716-66 STANDARD; PRT; 357 AA, 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 66, Application US/07609716 
XX 

CC Sequence 66, Application US/07609716 

CC Patent No. 5114581 

CC GENERAL INFORMATION: 

CC APPLICANT: Ferrari, Franco A. 

CC APPLICANT: Cappello, Joseph 

CC TITLE OF IIVENTION: Functional Recombinant^ Prepared 

CC TITLE OF IIVENTION: Synthetic Protein Polymer 

CC NUMBER OF SEQUENCES: 118 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Flehr, Hohbach, Test, Albritton & Herbert 

CC STREET : ; Tour Embarcadero Center, Suite 3400 
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CC CITY; San Francisco 

CC STATE: CA 

CC COUNTRY: OS 

CC ZIP: 94111 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1.0, Version #1.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/07/609,716 

CC FILING DATE: 06-NOV-1990 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Rowland, Bertram I 

CC REGISTRATION NUMBER: 20015 

CC REFERENCE/DOCKET NUMBER: A-55186-3/BIR 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: 415-781*1989 

CC TELEFAX: 415-398-3249 

CC INFORMATION FOR SEQ ID NO: 66: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 357 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY; linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 357 AA; 31954 MW; 670322 CN; 

Query Match 21.7%; Score 796; DB 1; Length 357; 

Best Local Similarity 48.4%; Pred. No. 5.31e-52; 

Matches 134; Conservative 45; Mismatches 95; Indels 3; Gaps 2; 

Db 40 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 99 

I hll I I: I Ihlllll h II :| I II |::|| III I :| 

Qy 148 GEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQ 207 

Db 100 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 159 

II hll II I III I I: I I I :|::| II I I :|: I I I : 

Qy 208 GPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDA 267 

Db 160 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 219 

I MM: I: I I Mill :|:||:MI:|: I I llhhll III I I 

Qy 268 GVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQ--P-GLQGVPGPPGAVGHPGAKGEP 324 

Db 220 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 279 

l::|:M :l 1 1 : 1 1 : 1 1 : I I 1 I :|: I II I Mill I hll : 
Qy 325 GSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLA 384 

Db 280 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAG 316 

II hll :| I I |::| |: I 

Qy 385 GPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENS 421 



RESULT 4 

ID US-08-642-255-95 STANDARD; PRT; 1077 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 95, Application US/08642255 
XX 

CC Sequence 95, Application US/08642255 

CC Patent No. 5773249 

CC GENERAL INFORMATION; 

CC APPLICANT: CAPPELLO, Joseph 

CC APPLICANT: FERRARI, Franco A. 

CC TITLE OF INVENTION: High Molecular Weight Collagen -Like 

CC TITLE OF INVENTION; Protein Polymers 

CC NUMBER OF SEQUENCES: 135 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: FLEHR, HOHBACH, TEST, ALBRITTON & HERBERT 



CC STREET: 4 Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: California 

CC COUNTRY: USA 

CC ZIP: 94111-4187 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

■CC COMPUTER; IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release *1.0, Version 11.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/642,255 

CC FILING DATE: 

CC CLASSIFICATION; 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: ROWLAND, Bertram I. 

CC REGISTRATION NUMBER: 20,015 

CC REFERENCE/DOCKET NUMBER: A55556-3/BIR 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 494*8700 

CC TELEFAX: (415) 494-8771 

CC TELEX: 910 277299 FHT UR 

CC INFORMATION FOR SEQ ID NO: 95: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1077 amino acids 

CC TYPE; amino acid 

CC STRANDEDNESS: Single 

CC TOPOLOGY; linear 

CC MOLECULE TYPE; protein 

SQ SEQUENCE 1077 AA; 91321 MW; 4955470 CN; 

Query Match 21.6%; Score 793; DB 1; Length 1077; 

Best Local Similarity 48.0%; Pred. No. 9.17e-52; 

Matches 143; Conservative 40; Mismatches 111; Indels 4; Gaps 2; 

Db 16 VTQLNRLAAHPPFASDPMGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAP 75 

I:: : I I llllllll II I Nil hi: h II 

Qy 125 VSHEHLLQRVDNFTQNP-GMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAM 183 

Db 76 GLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAP 135 

I :H I::hll:||: I :| MINI III hlhll I I I hi 
Qy 184 GRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGET 243 

Db 136 GLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAP 195 

I II I I: I :| I :l hll lh I I hill I III I II I 
Qy 244 GTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQP 303 

Db 196 GLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAP 255 

llll Ihlh I II I lh:H I :N Ihlll II II h I I II 
Qy 304 GLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGL- - -PGSPGSPGATGLKGSKGDTGLQGQQ 360 

Db 256 GLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQG 313 

I I :| lh I I l:M :h llll I I I h = l II I I :l 
Qy 361 GRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERG 418 



RESULT 5 

ID US-07-972-032-82 STANDARD; PRT; 1077 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 82, Application US/07972032 
XX 

CC Sequence 82, Application US/07972032 

CC Patent No. 5496712 

CC GENERAL INFORMATION: 

CC APPLICANT: Cappello, Joseph 

CC APPLICANT; Ferrari, Franco A. 

CC TITLE OF INVENTION; HIGH MOLECULAR WEIGHT COLLAGEN-LIKE 

CC TITLE OF INVENTION: PROTEIN POLYMERS 

CC NUMBER OF SEQUENCES: 85 
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CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Bertram I. Rowland 

CC STREET: 4 Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: California 

CC COUNTRY: USA 

CC ZIP: CA 94111 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1,0, Version #1,25 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/07/972,032 

CC FILING DATE: 19921105 

CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US/07/791,960 

CC FILING DATE: 12-NOV-1991 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Rowland, Bertram I, 

CC REGISTRATION NUMBER; 20,015 

CC REFERENCE/DOCKET NUMBER: A*55556*1/BIR;PROP-08-1 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 781-1989 

CC TELEFAX: (415) 398-3249 

CC INFORMATION FOR SEQ ID NO: 82: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1077 amino acids 

CC TYPE: AMINO ACID 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 1077 AA; 91321 MW; 4955470 CN; 

Query Match 21.6%; Score 793; DB 1; Length 1077; 

Best Local Similarity 48.0%; Pred. No. 9.17e-52; 

Matches 143; Conservative 40; Mismatches 111; Indels 4; Gaps 2; 

Db 16 VTQLNRLAAHPPFASDPMGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAP 75 

I:: : I I: :| I I IIIMIII II I Mil hi: II 

Qy 125 VSHEHLLQRVDNFTQNP-GMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAM 183 

Db 76 GLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAP 135 

I :H :: -II: I: I :| Ml III |:||:|| I I I |: I 

Qy 184' GRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGET 243 

Db 136 GLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAP 195 

I I I I I: I :| I :| hi! Ih I I hill I III I II I 

Qy 244 GTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQP 303 

Db 196 GLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAP 255 

MM 1:11: I II I I:: I :|l Ihlll II II h I I II 

Qy 304 GLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGL- * -PGSPGSPGATGLKGSKGDTGLQGQQ 360 

Db 256 GLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQGAPGLQGAPGAPGSQG 313 

I I :! II: I I Ihll :h INI I I I I 

Qy 361 GRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERG 418 



RESULT 6 

ID US-08-642-255-62 STANDARD; PRT; 1064 AA, 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 62, Application US/08642255 
XX 

CC Sequence 62, Application US/08642255 

CC Patent No, 5773249 

CC GENERAL INFORMATION: 

CC APPLICANT : CAPPELLO, Joseph 



CC APPLICANT: FERRARI, Franco A. 

CC TITLE OF INVENTION: High Molecular Weight Collagen-Like 

CC TITLE OF INVENTION: Protein Polymers 

CC NUMBER OF SEQUENCES: 135 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: FLEHR, HOHBACH, TEST, ALBRITTON & HERBERT 

CC STREET: 4 Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: California 

CC COUNTRY: USA 

CC ZIP: 94111-4187 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC -DOS/MS -DOS 

CC SOFTWARE: Patentln Release 1*1.0, Version #1,30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/642,255 

CC FILING DATE: 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: ROWLAND, Bertram I. 

CC REGISTRATION NUMBER: 20,015 

CC REFERENCE/DOCKET NUMBER: A55556-3/BIR 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 494*8700 

CC TELEFAX: (415) 494*8771 

CC TELEX: 910 277299 FHT UR 

CC INFORMATION FOR SEQ ID NO: 62: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1064 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 1064 AA; 91858 MW; 5472318 CN; 

Query Match 21.0%; Score 771; DB 1; Length 1064; 

Best Local Similarity 49,6%; Pred. No. 5.05e-50; 

Matches 138; Conservative 43; Mismatches 88; Indels 9; Gaps 7; 

Db 51 KGAPGPAGPPGSRGDPGPPGAPGPAGPPGSRGDPGPPGAPGPAGPPGSRGDPGPPGAPGP 110 

II |::| I :| I 'Ml: :| |: I h:||:|::| II I 

Qy 147 KGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGL 206 

Db 111 AGPPGSRGDPGPPGAPGPAGPPGSRGDPGPPGAPGPAGPPGSRGDPGPPGAPGPAGPPGS 170 

II h I h hill I Ihll I I II I h:h lllhll 

Qy 207 QGPQGAPGKQGATGTPGPQGEKGSKGD-G-GLIGPKGETGTKGEKGDLGLPGSKGDRGM 263 

Db 171 RGDPGPPGAPGPAGPPGSRGDPGPPGAPGPAGPPGSRGDPGPPGAPGPAGPPGSRGDPGP 230 

:lhl I I I I: Ihll I Ihll II M | || | :|||h I Ih 

Qy 264 KGDAGVMG-P-P-GAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGA 320 

Db 231 PGAPGPAGPPGSRGDPGPPGAPGPAGPPGSRGDPGPPGAHGPAGPKGAHGPAGPKGAHG- 289 

I Ihlhll I lhll:lh:l Ihll I hi I I Nil II 

Qy 321 KGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGS 380 

Db 290 P--AGPKGAHGPAGPKGAPGPAGPPGSRGDPGPPGAPG 325 

I llllll I II II I h:l :l I I I 

Qy 381 PGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERG 418 



RESULT 7 

ID US-07-609-716-65 STANDARD; PRT; 408 AA, 
XX 

AC XXXXXX 

XX 

DT 

XX 

DE Sequence 65, Application US/07609716 
XX 

CC Sequence 65, Application US/07609716 
CC Patent No. 5514581 
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CC GENERAL INFORMATION: 

CC APPLICANT: Ferrari, Franco A, 

CC APPLICANT: Cappello, Joseph 

CC TITLE OF INVENTION: Functional Recombinantly Prepared 

CC TITLE OF INVENTION: Synthetic Protein Polymer 

CC NUMBER OF SEQUENCES: 118 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE; Flehr, Hohbach, Test, Albritton & Herbert 

CC STREET: Four Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: CA 

CC COUNTRY: US 

CC ZIP: 94111 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC'DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1.0, Version 11,30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/07/609,716 

CC FILING DATE: 06-NOV-1990 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Rowland, Bertram I 

CC REGISTRATION NUMBER: 20015 

CC REFERENCE/DOCKET NUMBER: A-55186 "3/BIR 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: 415-781-1989 

CC TELEFAX: 415-398-3249 

CC INFORMATION FOR SEQ ID NO: 65: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 408 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 408 AA; 35237 MW; 959886 CN; 

Query Match 20,7%; Score 757; DB 1; Length 408; 

Best Local Similarity 47.6%; Pred, No. 6.45e-49; 

Matches 129; Conservative 38; Mismatches 104; Indels 0; Gaps 0; 

Db 55 GPPGPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAP 114 

I 1:11 I Ilimillh I: II I h Ihll Ihl I 

Qv 148 GEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQ 207 

Db 115 GPPGPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPP 174 

II IMI I: hill I h I I II I I: I I I II: I I I : 

Qy 208 GPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDA 267 

Db 175 GPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAPGPP 234 

I INI: I: I I Mill :l lh I I II I Mill: I I: I It:: 

Qy 268 GVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSA 327 

Db 235 GPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPPGPP 294 

1:11 :| Ihlhll: I h I :| I M :| llhl I 

Qy 328 GSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPK 387 

Db 295 GPPGPAGPVGSPGAPGPPGPPGPPGPPGAPG 325 

1:11 II I I h:| I I I I 

Qy 388 GAPGQAGQKGDQGVKGSSGEQGVKGEKGERG 418 



RESULT 8 

ID US-08-642-255-53 STANDARD; PRT; 777 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 53, Application US/08642255 

XX 

CC Sequence 53, Application US/08642255 



CC Patent No. 5773249 

CC GENERAL INFORMATION: 

CC APPLICANT: CAPPELLO, Joseph 

CC APPLICANT: FERRARI, Franco A, 

CC TITLE OF INVENTION: High Molecular Weight Collagen-Like 

CC TITLE OF INVENTION: Protein Polymers 

CC NUMBER OF SEQUENCES: 135 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: FLEHR, HOHBACH, TEST, ALBRITTON & HERBERT 

CC STREET: 4 Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: California 

CC COUNTRY: USA 

CC ZIP: 94111-4187 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1,0, Version 11.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER; US/08/642,255 

CC FILING DATE: 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: ROWLAND, Bertram I. 

CC REGISTRATION NUMBER: 20,015 

CC REFERENCE/DOCKET NUMBER: A55556-3/BIR 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 494-8700 

CC TELEFAX: (415) 494-8771 

CC TELEX; 910 277299 FHT UR 

CC INFORMATION FOR SEQ ID NO: 53: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 777 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 777 AA; 64133 MW; 2828908 CN; 

Query Match 20.6%; Score 755; DB 1; Length 777; 

Best Local Similarity 49.1%; Pred. No. 9,28e-49; 

Matches 133; Conservative 39; Mismatches 96; Indels 3; Gaps 2 

Db 85 GPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAP 144 

I Nil I II I : Ml: II:: II |: I II Ihll hll I 

Qy 148 GEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQ 207 

Db 145 GPAGPPGAPGPAGPPGAHGPAGPKGAHGPAGPKGAHGPAGPKGAHGPAGPKGAPGPAGPP 204 

II hll h:| I: hll I Mil I I II I :hll I I : 

Qy 208 GPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDA 267 

Db 205 GAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPA 264 

I Ihh h I 1 Ihll II III I I II I :lllll I ^ I Ihl 

Qy 268 GVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSA 327 

Db 265 GPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAHGPAGPKGAHG-P--AGPK 321 

hll :| : 1 : 1 1 : 1 1 : : I h I :| II I :| INI II :l I INI 

Qy 328 GSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPK 387 

Db 322 GAHGPAGPKGAHGPAGPKGAPGPAGPPGAPG 352 

IN II II :l hi I I I I 

Qy 388 GAPGQAGQKGDQGVKGSSGEQGVKGEKGERG 418 



RESULT 9 

ID US-08-642-255-52 STANDARD; PRT; 561 AA. 
XX 

AC xxxxxx 

xx 

DT 

XX 

DE Sequence 52, Application US/08642255 
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xx 

CC Sequence 52, Application US/08642255 

CC Patent No. 5773249 

CC GENERAL INFORMATION: 

CC APPLICANT: CAPPELLO, Joseph 

CC APPLICANT: FERRARI, Franco A, 

CC TITLE OF INVENTION: High Molecular Weight Collagen-Like 

CC TITLE OF INVENTION: Protein Polymers 

CC NUMBER OF SEQUENCES: 135 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: FLEHR, HOHBACH, TEST, ALBRITTON & HERBERT 

CC STREET: 4 Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: California 

CC COUNTRY: USA 

CC ZIP: 94111-4187 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC -DOS/MS -DOS 

CC SOFTWARE: Patentm Release #1,0, Version #1,30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/642,255 

CC FILING DATE: 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: ROWLAND, Bertram I. 

CC REGISTRATION NUMBER: 20,015 

CC REFERENCE/DOCKET NUMBER: A55556-3/BIR 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 494-8700 

CC TELEFAX: (415) 494-8771 

CC TELEX: 910 277299 FHT UR 

CC INFORMATION FOR SEQ ID NO: 52: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 561 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 561 AA; 46438 MW; 1494070 CN; 

Query Match 20.2%; Score 739; DB 1; Length 561; 

Best Local Similarity 48.7%; Pred. No. 1.70e-47; 

Matches 132; Conservative 39; Mismatches 97; Indels 3; Gaps 1; 

Db 193 GPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAP 252 

I Mil I II I 1 1 : 1 1 : 1 M : : II I: I II ll:H hll I 

Qv 148 GEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQ 207 

Db 253 G PAG P PG APG PAG P PG AHG PAG P KG AHG PAG PKG AHG PAG P KG AHG PAG PKG APG PAG PP 312 

II MM ll::| hll I INI Mill :hll I I 

Qy 208 GPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDA 267 

Db 313 GAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPA 372 

I MM: |: I I ||:|| I ||| I :||||| | :|: | ||:| 

Qy 268 GVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSA 327 

Db 373 GPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAGPPGAPGPAG- * -PPGAPGPAGPP 429 

Ml :| :|:||:||::| |: I :| I I I :| Mill |:|| III 

Qy 328 GSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPK 387 

Db 430 GAPGPAGPPGAPGPAGPPGAPGPAGPPGAPG 460 

Mil II I I MM I I I I 

Qy 388 GAPGQAGQKGDQGVKGSSGEQGVKGEKGERG 418 



RESULT 10 

ID PCT-US95-02251-12 STANDARD; PRT; 1442 AA. 
XX 

AC xxxxxx 

XX 

DT 



XX 

DE Sequence 12, Application PC/TUS9502251 
XX 

CC Sequence 12, Application PC/TUS9502251 

CC GENERAL INFORMATION: 

CC APPLICANT: 

CC TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR STIMULATING BONE 

CC TITLE OF INVENTION: CELLS 

CC NUMBER OF SEQUENCES: 18 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Arnold, White & Durkee 

CC STREET: P.O. Box 4433 

CC CITY: Houston 

CC STATE: Texas 

CC COUNTRY: United States of America 

CC ZIP: 77210 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS/ASCII 

CC SOFTWARE: Patentln Release tl.0, Version 

CC SOFTWARE: #1,30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: PCT/US95/02251 

CC FILING DATE: CONCURRENTLY HEREWITH 

CC CLASSIFICATION: 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 08/316,650 

CC FILING DATE: 30-SEP-1994 

CC CLASSIFICATION: 

CC APPLICATION NUMBER: US 08/199,780 

CC FILING DATE: 1B-FEB-1994 

CC CLASSIFICATION: 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Parker, David L. 

CC REGISTRATION NUMBER: 32,165 

CC REFERENCE/DOCKET NUMBER: UMIC009P" 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (512) 418*3000 

CC TELEFAX: (713) 789-2679 

CC TELEX: 79-0924 

CC INFORMATION FOR SEQ ID NO: 12: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1442 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: peptide 

SQ SEQUENCE 1442 AA; 137911 MW; 9845405 CN; 

Query Match 20.0%; Score 734; DB 2; Length 1442; 

Best Local Similarity 43.7%; Pred. No. 4.23e-47; 

Matches 128; Conservative 48; Mismatches 112; Indels 5; Gaps 3; 

Db 435 GAPGPAGEEGKRGARGEPGGAGPIGPPGERGAPGNRGFPGQDGLAGPKGAPGERGPSGLA 494 

I MM :| :|| | ||::|| |||:|:|| I I h |:|| :| Ml 

Qy 148 GEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQ 207 

Db 495 GPKGANGDPGRPGEPGLPGARGLTGRPGDAGPQGKVGPSGAPGEDGRPGPPGPQGARGQP 554 

II II I I I II I :| I I II I I I M I II: I :| M- 

Qy 208 GPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDA 267 

Db 555 GVMGFPGPKGANGEPGKAGEKGLAGAPGLRGLPGKDGETGAAGPPGPSGPAGERGEQGAP 614 

llll II: l::h MM MM II :| I I I Mill: I M Ml 1 

Qy 268 GVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSA 327 

Db 615 GPSGFQGLPGPPGPPGEGGKQGDQGIPGEAGAPGLVGPRGERGFPGERGSPGAQGLQGPR 674 

MM Ml:.:' M I I I I I I I M M I: II II: 

Qy 328 GSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPK 387 

Db 675 GLPGTPGTDGPKGAAGPDGPPGAQGPPGLQGMPGERGAAG-IAGPKG-DRGDV 725 

I Ml I II I \ II I I II = : I h MM 

Qy 388 G--APGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEV 437 
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RESULT 11 

ID (JS-08-642-255-32 STANDARD; PRT; 330 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 32, Application US/08642255 
XX 

CC Sequence 32, Application US/08642255 

CC Patent No. 5773249 

CC GENERAL INFORMATION: 

CC APPLICANT: CAPPELLO, Joseph 

CC APPLICANT: FERRARI, Franco A. 

CC TITLE OF INVENTION: High Molecular Weight Collagen -Like 

CC TITLE OF INVENTION: Protein Polymers 

CC NUMBER OF SEQUENCES : 135 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: FLEHR, HOHBACH, TEST, ALBRITTON & HERBERT 

CC STREET: 4 Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: California 

CC COUNTRY: USA 

CC ZIP: 94111-4187 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC -DOS/MS -DOS 

CC SOFTWARE: Patentln Release 11,0, Version #1,30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/642,255 

CC FILING DATE: 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: ROWLAND, Bertram I. 

CC REGISTRATION NUMBER: 20,015 

CC REFERENCE/DOCKET NUMBER: A55556-3/BIR 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 494-8700 

CC TELEFAX: (415) 494-8771 

CC TELEX: 910 277299 FHT UR 

CC INFORMATION FOR SEQ ID NO: 32: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 330 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 330 AA; 28876 MW; 625100 CN; 

Query Match 19.9%; Score 731; DB 1; Length 330; 

Best Local Similarity 48,2%; Pred, No. 7,29e-47; 

Matches 123; Conservative 37; Mismatches 95; Indels 0; Gaps 0; 

Db 55 GPPGPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAP 114 

I I : i I I IIIIEIIII!: h II I |: INI IMI I :l 

Qy 148 GEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQ 207 

Db 115 GPPGPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPP 174 

II l:M I: |:IH I I : I I 1 1 I I : I I I 1 1 : I I I : 

Qy 208 GPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDA 267 

Db 175 GPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAPGPP 234 

I Mil: I: I I Mill II: I I II I llllh i h I II:: 

Qy 268 GVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSA 327 

Db 235 GPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPPGPP 294 

|:|| :l (1:11:11: I |: I :| I I I :| IIMI I Ml :|l 

Qy 328 GSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPK 387 

Db 295 GPPGPAGPVGSPGAM 309 



I'll ii i i ' 
Qy 388 GAPGQAGQKGDQGVK 402 



RESULT 12 

ID US-08-175-155-69 STANDARD; PRT; 417 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 69, Application US/08175155 
XX 

CC Sequence 69, Application US/08175155 

CC Patent No. 5641648 

CC GENERAL INFORMATION: 

CC APPLICANT: Ferrari, Franco A. 

CC APPLICANT: Cappello, Joseph 

CC APPLICANT: Crissman, John W. 

CC APPLICANT: Dorman, Mary A. 

CC TITLE OF INVENTION: Methods for Preparing Synthetic 

CC TITLE OF INVENTION: Repetitive DNA 

CC NUMBER OF SEQUENCES: 69 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE; Flehr, Hohbach, Test, Albritton s Herbert 

CC STREET: Four Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: CA 

CC COUNTRY: US 

CC ZIP; 94111 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1.0, Version #1.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/175,155 

CC FILING DATE: 29-DEC-1993 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Rowland, Bertram I. 

CC REGISTRATION NUMBER: 20015 

CC REFERENCE/DOCKET NUMBER; A-55186 -5/BIR 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: 415-781*1989 

CC TELEFAX: 415-398-3249 

CC INFORMATION FOR SEQ ID NO: 69: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 417 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: Single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: peptide 

SQ SEQUENCE 417 AA; 37083 MW; 955434 CN; 

Query Match 19,7%; Score 722; DB 1; Length 417; 

Best Local Similarity 46.1%; Pred. No. 3,74e-46; 

Matches 136; Conservative 32; Mismatches 123; Indels 4; Gaps 2; 

Db 16 VTQLNRLAAHPPFASDPMGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLP 75 

|:: : I |: :| I I II II I hi I IIMMI: h I 

Qy 125 VSHEHLLQRVDNFTQNP-GMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAM 183 

Db 76 GSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLP 135 

I II I :|||l II I :| MM II II lllllll II I I II I 

Qy 184 GRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGET 243 

Db 136 GSPGAPGT- - -PGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQ 192 

|: I I ||: I I I :| II I II I I III IN Ml I I 

Qy 244 GTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQP 303 

Db 193 GLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQ 252 

II I IIMI: 1 I I ll::|:M llllllhll: I I I I I I 
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Qy 304 GLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRK 363 

Db 253 GLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPG 307 

I :l l|::| ! II II :|: Ml I I I |::| lh I I I 
Qy 364 GESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERG 418 



RESULT 13 

ID US-08-477-509B-104 STANDARD; PRT; 417 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 104, Application (JS/08477509B 
XX 

CC Sequence 104, Application US/08477509B 

CC Patent No. 5770697 

CC GENERAL INFORMATION: 

CC . APPLICANT: Ferrari, Franco A 

cc applicant: cappello, Joseph 

CC APPLICANT : Crissman, John w 

CC APPLICANT: Dorman, Mary A 

CC TITLE OF INVENTION: No. 5770697el Peptides Comprising Repetitive 

CC TITLE OF INVENTION: Units of Amino Acids and DNA Sequences Encoding the Same 

CC NUMBER OF SEQUENCES: 112 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Flehr, Hohbach, Test, Albritton & Herbert 

CC STREET: Four Embarcadero Center, Suite 3400 

CC CITY; San Francisco 

CC STATE: California 

CC COUNTRY: US 

CC ZIP: 94111 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1.0, Version 11.30 

CC CURRENT APPLICATION DATA; 

CC APPLICATION NUMBER: US/08/477, 509B 

CC FILING DATE: 07-JUN-1995 

CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER; US 08/175,155 

CC FILING DATE: 29-DEM993 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 08/053,049 

CC FILING DATE: 22 -APR- 1993 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 07/114,618 

CC FILING DATE: 29 -OCT -1987 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 06/927,258 

CC FILING DATE: G4-NOV-1986 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Trecartin, Richard F. 

CC REGISTRATION NUMBER: 31,801 

CC REFERENCE/DOCKET NUMBER: A-55186-7/RFT/MTK 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: 415-781-1989 

CC TELEFAX: 415-398-3249 

CC INFORMATION FOR SEQ ID NO: 104: 

CC SEQUENCE CHARACTERISTICS; 

CC LENGTH: 417 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: peptide 

SQ SEQUENCE 417 AA; 37083 MW; 955434 CN; 

Query Match 19.7%; Score 722; DB 1; Length 417; 

Best Local Similarity 46.1%; Pred. No. 3.74e-46; 

Matches 136; Conservative 32; Mismatches 123; Indels 4; Gaps 2; 



Db 16 VTQLNRLAAHPPFASDPMGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLP 75 

I:: : I |: :| I I II II I |: I I 1 1 : 1 1 : 1 : h I 
Qy 125 VSHEHLLQRVDNFTQNP-GMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAM 183 

Db 76 GSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLP 135 

I II I :IHI II I :| Nil II II I ' III I M I 

Qy 184 GRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGET 243 

Db 136 GSPGAPGT- - -PGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQ 192 

hll II: I I I :| II I II I I 111 lh! Ml I I 
Qy 244 GTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQP 303 

Db 193 GLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQ 252 

II I Ihll: I I I ll::|:M llllllhlh I I I I I I 

Qy 304 GLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRK 363 

Db 253 GLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPG 307 

I :| :: | || || :|: |:|| | | I |::| ||: I I I 
Qy 364 GESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERG 418 



RESULT 14 

ID US-08-642-255-102 STANDARD; PRT; 417 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 102, Application US/08642255 
XX 

CC Sequence 102, Application US/08642255 

CC Patent No. 5773249 

CC GENERAL INFORMATION: 

CC APPLICANT: CAPPELLO, Joseph 

CC APPLICANT: FERRARI, Franco A. 

CC title OF INVENTION: High Molecular Weight Collagen -Like 

CC TITLE OF INVENTION: Protein Polymers 

CC NUMBER OF SEQUENCES: 135 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: FLEHR, HOHBACH, TEST, ALBRITTON & HERBERT 

CC STREET; 4 Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: California 

CC COUNTRY; USA 

CC ZIP: 94111-4187 ' 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC -DOS/MS -DOS 

CC SOFTWARE: Patentln Release II. 0, Version 11.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER; US/08/642,255 

CC FILING DATE: 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: ROWLAND, Bertram L 

CC REGISTRATION NUMBER: 20,015 

CC REFERENCE/DOCKET NUMBER: A55556-3/BIR 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 494-8700 

CC TELEFAX: (415) 494-8771 

CC TELEX; 910 277299 FHT UR 

CC INFORMATION FOR SEQ ID NO: 102: 

CC SEQUENCE CHARACTERISTICS: 

CC XENGTH: 417 amino acids 

CC TYPE; amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY; linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 417 AA; 37083 MW; 955434 CN; 

Query Match 19.7%; Score 722; DB 1; Length 417; 



Fri Sep 11 07:51:44 1998 
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Best Local Similarity 46. U; Pred, No, 3.74e-46; 

Matches 136; Conservative 32; Mismatches 123; Indels 4; Gaps 2; 

Db 16 VTQLNRLAAHPPFASDPMGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLP 75 

I:: : I |: :| I I II II I |: I I f I : I E : I : I 
Qy 125 VSHEHLLQRVDNFTQNP-GMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAM 183 

Db 76 GSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLP 135 

I II I :|lll II I :| Mil II II lllllll II II It I 

Qy 184 GRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGET 243 

Db 136 GSPGAPGT---PGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQ 192 

I: I I . II: I I I :| II I II I I III Ihl Ml I I 
Qy 244 GTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQP 303 

Db 193 GLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQ 252 

II I Ihll: I I I 1 1 : : 1 : 1 1 MIIIHMM I II I I I 

Qy 304 GLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLRGSKGDTGLQGQQGRK 363 

Db 253 GLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPG 307 

I M I II II 1:11 I I I i::l II: I I I 

Qy 364 GESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERG 418 



RESULT 15 

ID US-08-642-255-101 STANDARD; PRT; 837 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 101, Application US/08642255 

XX 

CC Sequence 101, Application OS/08642255 

CC Patent No. 5773249 

CC GENERAL INFORMATION: 

CC APPLICANT: CAPPELLO, Joseph 

CC APPLICANT: FERRARI, Franco A. 

CC TITLE OF INVENTION: High Molecular Weight Collagen-Like 

CC TITLE OF INVENTION: Protein Polymers 

CC NUMBER OF SEQUENCES: 135 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: FLEHR, HOHBACH, TEST, ALBRITTON & HERBERT 

CC STREET: 4 Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: California 

CC COUNTRY: USA 

CC ZIP: 94111-4187 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1.0, version #1.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/642,255 

CC FILING DATE: 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: ROWLAND, Bertram I. 

CC REGISTRATION NUMBER: 20,015 

CC REFERENCE/DOCKET NUMBER: A55556-3/BIR 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (415) 494-8700 

CC TELEFAX: (415) 494-8771 

CC TELEX: 910 277299 FHT UR 

CC INFORMATION FOR SEQ ID NO: 101: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 837 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 837 AA; 72682 MW; 3863626 CN; 



Query Match 19.7%; Score 722; DB 1; Length 837; 

Best Local Similarity 46.11; Pred. No. 3.74e-46; 

Matches 136; Conservative 32; Mismatches 123; Indels 4; Gaps 2 

Db 16 VTQLNRLAAHPPFASDPMGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLP 75 

h: : I |: :| I I II II I Ml I IIMhh h I 
Qy 125 VSHEHLLQRVDNFTQNP-GMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAM 183 

Db 76 GSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLP 135 

I II I Mill II I MM M II lllllll lllllll 

Qy 184 GRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGET 243 

Db 136 GSPGAPGT - - - PGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQ 192 

|: I I II: I I I M Ml II I I III MM III I I 
Qy 244 GTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQP 303 

Db 193 GLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQ 252 

II I Ml: I I I M::|:li MillUMM I I I I I I 

Qy 304 GLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRK 363 

Db 253 GLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPGTPGPQGLPGSPGAPG 307 

I M IhM I II II M: MM I I I MM II: II I 
Qy 364 GESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERG 418 

Search completed: Fri Sep 11 06:57:06 1998 
Job time : 13 sees. 
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Release 3,1A John F, Collins, Biocomputing Research Unit, 
Copyright (c) 1993*1998 University of Edinburgh, O.K. 
Distribution rights by Oxford Molecular Ltd 

MPsrch_pp protein - protein database search, using Smith -Waterman algorithm 

Run on: Fri Sep 11 06:57:25 1998; MasPar time 22.56 Seconds 

724.438 Million cell updates/sec 

Tabular output not generated. 



Title: 

Description: 
Perfect Score: 
Sequence: 

Scoring table: 



Searched: 



>US-08-794-795-6 

(1-520) from US08794795 . pep 

3665 

1 MRNKK ILKEDELLSETQQAA NSWGHHDCSHEEDAGVECSV 520 

PAM 150 
Gap 11 

286381 seqs, 31430319 residues 



Post-processing: Minimum Match 0% 

Listing first 45 summaries 



Database: 



a-pending 
1:P9 2 :U60 3:U7 4: 
10:U86 11:U87 12:U 
17:NEWU8 18:NEWU9 



0 5:U81 6:U82 7 :U83 8:U84 9:U85 
13 :U89 14 :U90 15:091 16:NEWP 



Statistics: Mean 34.734; Variance 179.067; scale 0.194 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution, 



SUMMARIES 



Result 



Query 



NO. 


Score 


Match Length DB 


ID 




Description 


Pred. No, 


1 


3665 


100.0 


520 11 


US-0 


8-794- 


Sequence 


6, Applicatio 


2,68e 


290 


2 


3653 


99.7 


520 13 


us-o 


8*934* 


Sequence 


1, Applicatio 


2.70e 


289 


3 


3493 


95.3 


495 11 


US-0 


8-794- 


Sequence 


2, Applicatio 


6,29e 


276 


4 


2414 


65.9 


518 12 


us-o 


8-893- 


Sequence 


2, Applicatio 


5.98e 


186 


5 


2414 


65.9 


518 13 


US-0 


8-934- 


Sequence 


3, Applicatio 


5,98e 


186 


6 


2369 


64.6 


489 11 


us-o 


8-794- 


Sequence 


7, Applicatio 


3.32e 


182 


7 


796 


21.7 


357 8 


US-0 


8-478- 


Sequence 


66, Applicati 


2,12e 


52 


8 


796 


21.7 


357 8 


us-o 


8-475- 


Sequence 


66, Applicati 


2.12e 


52 


9 


757 


20.7 


408 8 


US-08-475- 


Sequence 


65, Applicati 


3,07e 


49 


10 


757 


20.7 


408 8 


us-o 


8-478- 


Sequence 


65, Applicati 


3,07e 


49 


11 


757 


20,7 


1107 11 


us-o 


8-763- 


Sequence 


8, Applicatio 


3.07e 


49 


12 


757 


20,7 


1169 11 


US-08-763- 


Sequence 


5, Applicatio 


3.07e 


49 


13 


757 


20.7 


1171 11 


us-o 


8-763- 


Sequence 


6, Applicatio 


3.07e 


49 


14 


757 


20.7 


1341 5 


us-o 


8-187- 


Sequence 


18, Applicati 


3.07e 


49 


15 


757 


20.7 


1341 13 


us-o 


8-963- 


Sequence 


18, Applicati 


3.07e 


49 


16 


757 


20.7 


1388 11 


us-o 


8-763- 


Sequence 


7, Applicatio 


3.07e 


49 


17 


749 


20.4 


1078 5 


us-o 


8*187- 


Sequence 


21, Applicati 


1.36e 


48 


18 


749 


20.4 


1078 13 


us-o 


8-963* 


Sequence 


21, Applicati 


1.36e 


48 


19 


742 


20.2 


655 3 


US-07-616- 


Sequence 


1, Applicatio 


5.03e 


48 


20 


734 


20.0 


1442 7 


US-0 


8-316- 


Sequence 


12, Applicati 


2.23e 


47 


21 


729 


19.9 


492 8 


us-o 


8-472- 


Sequence 


12, Applicati 


5.66e 


47 



22 


729 


19.9 


492 8 


US-08- 


461- 


Sequence 12, Applicati 


5.66e-47 


23 


729 


19.9 


492 8 


US-Q8- 


469- 


Sequence 12, Applicati 


5.66e-47 


24 


729 


19.9 


492 8 


US-08- 


461- 


Sequence 12, Applicati 


5.66e-47 


25 


728 


19.9 


492 8 


US-08- 


469- 


Sequence 11, Applicati 


6.82e-47 


26 


728 


19,9 


492 8 


US-08- 


461- 


Sequence 11, Applicati 


6.82e-47 


27 


728 


19.9 


492 8 


US-08- 


472- 


Sequence 11, Applicati 


6.82e-47 


28 


728 


19.9 


492 8 


US-08- 


461- 


Sequence 11, Applicati 


6.82e-47 


29 


730 


19.9 


1418 8 


US-08- 


448- 


Sequence 1, Applicatio 


4.70e-47 


30 


730 


19.9 


1418 13 


US-08- 


963- 


Sequence 20, Applicati 


4.70e-47 


31 


730 


19.9 


1418 3 


US-07- 


984- 


Sequence 1, Applicatio 


4.70e-47 


32 


730 


19,9 


1418 5 


US-08- 


187- 


Sequence 20, Applicati 


4.70e-47 


33 


727 


19.8 


1017 8 


US-08- 


461- 


Sequence 10, Applicati 


8.22e-47 


34 


727 


19.8 


1017 8 


US-08- 


461- 


Sequence 10, Applicati 


8.22e-47 


35 


727 


19,8 


1017 8 


US-08- 


472* 


Sequence 10, Applicati 


8.22e-47 


36 


727 


19,8 


1017 8 


US-08- 


469* 


Sequence 10, Applicati 


8.22e-47 


37 


722 


19,7 


417 11 


US-08- 


707 - 


Sequence 76, Applicati 


2.08e-46 


38 


722 


19.7 


417 8 


US-08- 


477- 


Sequence 104, Applicat 


2.08e-46 


39 


722 


19.7 


417 8 


US-G8- 


482- 


Sequence 104, Applicat 


2.08e-46 


40 


722 


19,7 


417 11 


US-08- 


707- 


Sequence 76, Applicati 


2.08e-46 


41 


722 


19,7 


837 11 


US-08- 


707* 


Sequence 75, Applicati 


2.08e-46 


42 


722 


19,7 


837 11 


US-08- 


707- 


Sequence 75, Applicati 


2.08e-46 


43 


722 


19.7 


837 8 


US-08- 


477- 


Sequence 103, Applicat 


2.08e-46 


44 


722 


19,7 


837 8 


US-08- 


482- 


Sequence 103, Applicat 


2.08e-46 


45 


696 


19.0 


1366 5 


US-08- 


187- 


Sequence 19, Applicati 


2.62e-44 



ALIGNMENTS 



RESULT 1 



ID 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
CC 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 



US-08-794-795-6 



xxxxxx 



STANDARD; 



PRT; 520 AA. 



Sequence 6, Application US/08794795 

Sequence 6, Application US/08794795 
GENERAL INFORMATION; 

APPLICANT: Elshourlagy, Nabil 

APPLICANT: Adamou, John 

APPLICANT: Gross, Mitchell 

APPLICANT: Lysko, Paul 

TITLE OF INVENTION: Human Macro Scavenger Rec 
TITLE OF INVENTION: eptor 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SmithKline Beecham Corporation 

STREET: 709 Swedeland Road 

CITY: King of Prussia 

STATE: PA 

COUNTRY: USA 

ZIP: 19406 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Diskette 

COMPUTER: IBM Compatible 

OPERATING SYSTEM: DOS 

SOFTWARE: FastSEQ for Windows Version 2.0 
CURRENT APPLICATION DATA; 

APPLICATION NUMBER: US/08/794,795 

FILING DATE: 04-FEB-1997 

CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: ATG50009P 

FILING DATE: 22-MAY-1996 
ATTORNEY/AGENT INFORMATION: 

NAME: Han, William T 

REGISTRATION NUMBER: 34,344 

REFERENCE/DOCKET NUMBER: ATG50009 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 610-270-5219 

TELEFAX: 610-270-4026 

TELEX: 
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CC INFORMATION FOR SEQ ID NO: 6: 

CC SEQUENCE CHARACTERISTICS; 

CC LENGTH: 520 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: peptide 

SQ SEQUENCE 520 AA; 52658 MW; 1340662 CN; 

Query Match 100,0%; Score 3665; DB 11; Length 520; 

Best Local Similarity 100,0%; Pred. No, 2.68e-290; 

Matches 520; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Db 1 MRNKKILKEDELLSETQQAAFHQIAMEPFEINVPKPKRRNGVNFSLAVWIYLILLTAGA 60 

IIIMIMMIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIII 
Qy 1 MRNKKILKEDELLSETQQAAFHQIAMEPFEINVPKPKRRNGVNFSLAVWIYLILLTAGA 60 

Db 61 GLLWQVLNLQARLRVLEMYFLNDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQL 120 

lllllllillltlllilllllllllllilllllMIIIIIIIIIIIIIMNIIIIMII 
Qy 61 GLLWQVLNLQARLRVLEMYFLNDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQL 120 

Db i 121 TWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAK 180 

IIIIIMIIIIEIMIIIIIMIIIMIIIIIIIIMIIMIIMIIIIIMIIIIIMI 
Qy 121 TWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAK 180 

Db 181 GAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPK 240 

MIIMI III Ml IMIIIIM II !|: ,MI|!M II I: II 
Qy 181 GAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPK 240 

Db 241 GETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQ 300 

IIIIIIMIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIllllilllllllllllMII 
Qy 241 GETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQ 300 

Db 301 GQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQ 360 

IIMIMIIIMIIIIIMIIMIIMMIMIIIIIMMIIMIIIIIMMIIIMI 
Qy 301 GQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQ 360 

Db . 361 GRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGEN 420 

IIIIIIIIIIIIIIIMIIIIIIIIIMilllllllllllllllllllllllllllllll 
Qy 361 GRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGEN 420 

Db 421 SVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQ 480 

MllllllllllllllllllllillllllllllllilllilillllllllllllllllM 
Qy 421 SVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQ 480 

Db 481 IWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECSV 520 

: 1 1 ' 1 1 1 1 ; 1 1 ■ 1 1 ' 

Qy 481 IWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECSV 520 



RESULT 2 

ID US-08-934 -168-1 STANDARD; PRT; 520 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 1, Application US/08934168 
XX 

CC Sequence 1, Application US/08934168 

CC GENERAL INFORMATION: 

CC APPLICANT: Bandman, Olga 

CC APPLICANT: Lai, Preeti 

CC APPLICANT: Corley, Neil C. 

CC APPLICANT: Shah, Purvi 

CC TITLE OF INVENTION: HUMAN MARCO 

CC NUMBER OF SEQUENCES: 3 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Incyte Pharmaceuticals, Inc, 

CC STREET: 3174 Porter Drive 

CC CITY: Palo Alto 

CC STATE: CA 

CC COUNTRY: USA 



CC 


ZIP: 94304 


CC 


COMPUTER READABLE FORM: 


CC 


MEDIUM TYPE: Diskette 


CC 


COMPUTER: IBM Compatible 


CC 


OPERATING SYSTEM: DOS 


CC 


SOFTWARE: FastSEQ for Windows Version 2,0 


CC 


CURRENT APPLICATION DATA: 


CC 


APPLICATION NUMBER: US/08/934,168 


CC 


FILING DATE: Herewith 


CC 


CLASSIFICATION: 435 


CC 


PRIOR APPLICATION DATA: 


CC 


APPLICATION NUMBER: 


CC 


FILING DATE: 


CC 


ATTORNEY/AGENT INFORMATION: 


CC 


NAME: Billings, Lucy J. 


CC 


REGISTRATION NUMBER: 36,749 


CC 


REFERENCE/DOCKET NUMBER: PF-0392 US 


CC 


TELECOMMUNICATION INFORMATION: 


CC 


TELEPHONE: 650-855-0555 


CC 


TELEFAX: 650-845-4166 


CC 


TELEX: 


CC 


INFORMATION FOR SEQ ID NO: 1: 


CC 


SEQUENCE CHARACTERISTICS: 


CC 


LENGTH: 520 amino acids 


CC 


TYPE: amino acid 


CC 


STRANDEDNESS: single 


CC 


TOPOLOGY: linear 


CC 


IMMEDIATE SOURCE: 


CC 


LIBRARY: MMLR3DT01 


CC 


CLONE: 569648 


SQ 


SEQUENCE 520 AA; 52674 MW; 1339522 CN; 



Query Match 99,7%; Score 3653; DB 13; Length 520; 

Best Local Similarity 99.8%; Pred. No. 2.70e-289; 



Matches 


519; Conservative 0; Mismatches 1; Indels 0; Gaps 


Db 


l 


MRNKKILKEDELLSETQQAAFHQIAMEPFEINVPKPKRRNGVNFSLAWVIYLILLTAGA 


60 






MINIMI M IMIIIIM II ,11 :|1IHMI II II. -111 




Qy 


l 


MRNKKILKEDELLSETQQAAFHQIAMEPFEINVPKPKRRNGVNFSLAVWIYLILLTAGA 


60 


Db 


61 


GLLWQVLNLQARLRVLEMYFLNDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQL 


120 






MM MM 1 IIIMM MM MMMM. Ml II MM 




Qy 


61 


GLLWQVLNLQARLRVLEMYFLNDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQL 


120 


Db 


121 


TWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAK 


180 






MMI M M M IIIMM I 1 M' MMMMMM II MM 




Qy 


121 


TWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAK 


180 


Db 


181 


GAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPK 


240 






MM MM M MIIMI M M MMMM M M MM 




Qy 


181 


GAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPK 


240 


Db 


241 


GETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRLGPPGLAGFPGAKGDQ 


300 






MiM 1 M M MiilM M II Mill 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 




Qy 


241 


GETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQ 


300 


Db 


301 


GQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQ 


360 






MINI 1 MM MIMM N M TMMM M Ml MM 1 




Qy 


301 


GQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQ 


360 


Db 


361 


GRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGEN 


420 






IIMM M NIMMIMMM II MMMN MMI MIIMI 




Qy 


361 


GRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGEN 


420 


Db 


421 


SVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQ 


480 






MMMM M M IIIMM M II MMMM M M MMM 




Qy 


421 


SVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQ 


480 


Db 


481 


IWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECSV 520 








II IN M 1 M MMM M M MM 




Qy 


481 


IWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECSV 520 
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RESULT 3 

ID US-08-79H95-2 STANDARD; PRT; 495 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 2, Application US/08794795 
xx 

CC Sequence 2, Application US/08794795 

CC GENERAL INFORMATION; 

CC APPLICANT: Elshourlagy, Nabil 

CC APPLICANT; Adamou, John 

CC APPLICANT; Gross, Mitchell 

CC APPLICANT; Lysko, Paul 

CC title OF INVENTION: Human Macro Scavenger Rec 

CC TITLE OF INVENTION: eptor 

CC NUMBER OF SEQUENCES: 9 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: SmithKline Beecham Corporation 

CC STREET: 709 Svedeland Road 

CC CITY: King of Prussia 

CC STATE: PA 

CC COUNTRY: USA 

CC ZIP: 19406 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Diskette 

CC COMPUTER: IBM Compatible 

CC OPERATING SYSTEM: DOS 

CC SOFTWARE: FastSEQ for windows Version 2,0 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/794,795 

CC FILING DATE: G4-FEB-1997 

CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: ATG50009P 

CC FILING DATE: 22-MAY-1996 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Han, William T 

CC REGISTRATION NUMBER: 34,344 

CC REFERENCE/DOCKET NUMBER; ATG50009 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: 610-270-5219 

CC TELEFAX: 610-270-4026 

CC TELEX: 

CC INFORMATION FOR SEQ ID NO: 2; 

CC SEQUENCE CHARACTERISTICS; 

CC LENGTH; 495 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE; peptide 

SQ SEQUENCE 495 AA; 49764 MW; 1218428 CN; 



Query Match 95.3%; Score 3493; DB 11; Length 495; 

Best Local Similarity 99.8%; Pred. No. 6.29e-276; 

Matches 494; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Db 


l 


MEPFEINVPKPKRRNGVNFSLAVWIYLILLTAGAGLLWQVLNLQARLRVLEMYFLNDT 


60 






IIIIIIMIIIIIIIIIIIIIMIIllllllllllllllllllllllllllllilllll! 




Qy 


26 


ME PFE I NVPKPK RRNGVNFS LA VWI YL I LLT AG AG LL WQVLNLQ ARLRVLEMYFLNDT 


85 


Db 


61 


LAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFR 


120 






IMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMtllllllll 




Qy 


86 


LAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFR 


145 


Db 


121 


IKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGTTGPSGPQGPPGVKGEAG 


180 






M MUM MIMM III MUM Ml Mill Mi MlM M Ml 




Qy 


146 


I KG EQG APG LQGHKG AMGMPG APG P PG P PAEKG AKG AMG RDG ATG PSG PQG P PG VKG E AG 


205 


Db 


181 


LQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKG 


240 






IM MUM MMM II! MMI IM MIM II MM II Ml 




Qy 


206 


LQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKG 


265 



Db 241 DAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPG 300 

M 1 M M'M 'M MIM Ml I I MMMI iMIM M 
Qy 266 DAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPG 325 

Db 301 SAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAG 360 

MMi M MM M MIM M MM I MM' M' IMM I 
Qy 326 SAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAG 385 

Db 361 PKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGT 420 

I MM M Ml I Ml M MM ! MM M' MM: M 
Qy 386 PKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGT 445 

Db 421 ICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSWGH 480 

I MM M M'' M: MM M M I '! IM M MM M' 
Qy 446 ICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSWGH 505 

Db 481 HDCSHEEDAGVECSV 495 

M MM Ml I 
Qy 506 HDCSHEEDAGVECSV 520 



RESULT 4 

ID US-08-893-467A-2 STANDARD; PRT; 518 AA. 
XX 

AC XXXXXX 

XX 

DT 

XX 

DE Sequence 2, Application US/08893467A 

XX 

CC Sequence 2, Application US/08893467A 

CC GENERAL INFORMATION; 

CC APPLICANT; Tryggvason, Karl 

CC APPLICANT: Elomaa, Outi 

CC APPLICANT; Kangas, Maarit 

CC TITLE OF INVENTION; An Insolated DNA Sequence For a 

CC TITLE OF INVENTION; Novel Macrophage Receptor with 

CC TITLE OF INVENTION: a Collagenous Domain and the 

CC TITLE OF INVENTION: Polypeptide Chain Encoded by 

CC TITLE OF INVENTION: such a Sequence 

CC NUMBER OF SEQUENCES; 2 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Fay, Sharpe, Beall, Fagan, 

CC ADDRESSEE: Minnich & McKee 

CC STREET: 1100 Superior Avenue 

CC STREET: Suite 700 

CC CITY: Cleveland 

CC STATE: Ohio 

CC COUNTRY; U.S.A. 

CC ZIP: 44114-2518 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Diskette, 3.50 inch, 

CC MEDIUM TYPE: 720 Kb storable 

CC COMPUTER; IBM PS/2, Model 35 SX 

CC OPERATING SYSTEM: DOS 5.0 

CC SOFTWARE: Word Perfect 5.1 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER; US/08/893, 467A 

CC FILING DATE: 

CC CLASSIFICATION: 435 

CC ATTORNEY/AGENT INFORMATION; 

CC NAME: Minnich, Richard J, 

CC REGISTRATION NUMBER; 24,175 

CC REFERENCE/DOCKET NUMBER: TRV 2 009 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: (216) 861-5582 

CC TELEFAX: (216) 241-1666 

CC TELEX; (216) 980162 

CC INFORMATION FOR SEQ ID NO: 2; 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 518 amino acids 

CC TYPE; amino acid 
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CC STRANDEDNESS: Single 

CC TOPOLOGY: Linear 

SQ SEQUENCE 518 AA; 52730 MW; 1281000 CN; 

Query Match 65.91; Score 2414; DB 12; Length 518; 

Best Local Similarity 67.3%; Pred. No. 5.98e-186; 

Matches 352; Conservative 73; Mismatches 89; Indels 9; Gaps 8; 

Db 1 MGSKELLKEEDFLGSTEDRADFDQAMFPVMETFEINDPVPKKRNGGTFCMAVMAIHLILL 60 

I :l :ill:::h I:: I I I : : II III! I MM I :||: I III! 
Qy 1 MRNKKILKEDELLSETQQ-AAFHQ-I-A-MEPFEINVPKPKRRNGVNFSLAVWIYLILL 56 

Db 61 TAGTALLLIOVLNLQEQLQMLEMCCGNGSLAIEDKPFFSL-OWA-PKTHLVPRAQGLQAL 118 

ll|::||::|IMII :|::ill I :|l II I III! II II I II I 
Qy 57 TAGAGLLWQVLNLQARLRVLEMYFLNDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVL 116 

Db 119 QAQLSWVHTSQEQLRQQFNNLTQNPELFQIKGERGSPG-P-KGAPGAPGIPGLPGPAAE 175 

lllhll: |:|:| |: :|:|||| :|:||||:|:|l III I II II M 
Qy 117 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAE 176 

Db 176 KGEKGAAGRDGTPGVQGPQGPPGSKGEAGLQGLTGAPGKQGATGAPGPRGEKGSKGDIGL 235 

ii 111 mi: i until ii ii iiiniini:iii:iiniin n 

Qy 177 KGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGL 236 

Db 236 TGPKGEHGTKGDKGDLGLPGNKGDMGMKGDTGPMGSPGAQGGKGDAGKPGLPGLAGSPGV 295 

lllll lllhlllllllhlll Ml: Ihllllhlll |:|| inn || 
Qy 237 IGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGA 296 

Db 296 KGDQGKPGVQGVPGPQGAPGLSGAKGEPGRTGLPGPAGPPGIAGNPGIAGVKGSKGDTGI 355 

iiiii imiiin ii i mum n n n n :mi minimi: 

Qy 297 KGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGL 356 

Db 356 QGQKGTKGESGVPGLVGRKGDTGSPGLAGPKGEPGRVGQKGDPGMKGSSGQQGQKGEKGQ 415 

III I 1 1 1 1 1 1 1 1 I II: III II: lllll MM M Hill: 
Qy 357 QGQQGRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGE 416 

Db 416 KGESFQRVRIMGGTNRGRAEVYYNNEWGTICDDDWDNNDATVFCRMLGYSRGRALSSYGG 475 

:||: 1 1 1 : 1 : : 1 1 1 i 1 1 1 1 1 : 111111111:1111 

Qy 417 RGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGA 476 

Db 476 GSGNIWLDNVNCRGTENSLWDCSKNSWGNHNCVHNEDAGVECS 518 

1:1 lllllt 1 1 1 1 1 : : M |:|lll|:|:| |:||||llll 
Qy 477 GTGQIWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECS 519 



RESULT 5 

ID US-08-934-168-3 STANDARD; PRT; 518 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 3, Application US/08934168 
XX 

CC Sequence 3, Application US/08934168 

CC GENERAL INFORMATION: 

CC APPLICANT: Bandman, Olga 

CC APPLICANT : Lai, Preeti 

CC APPLICANT: Corley, Neil C. 

CC APPLICANT: Shah, Purvi 

CC TITLE OF INVENTION: HUMAN MARCO 

CC NUMBER OF SEQUENCES: 3 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Incyte Pharmaceuticals, Inc. 

CC STREET: 3174 Porter Drive 

CC CITY: Palo Alto 

CC STATE: CA 

CC COUNTRY: USA 

CC ZIP: 94304 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Diskette 

CC COMPUTER: IBM Compatible 

CC OPERATING SYSTEM: DOS 



CC 


SOFTWARE: FastSEQ for Windows Version 2,0 


CC 


CURRENT APPLICATION DATA: 


CC 


APPLICATION NUMBER: US/08/934,168 


CC 


FILING DATE: Herewith 


CC 


CLASSIFICATION: 435 


CC 


PRIOR APPLICATION DATA: 


CC 


APPLICATION NUMBER: 


CC 


FILING DATE: 


CC 


ATTORNEY/AGENT INFORMATION: 


CC 


NAME: Billings, Lucy J. 


CC 


REGISTRATION NUMBER: 36,749 


CC 


REFERENCE/DOCKET NUMBER: PF-0392 US 


CC 


TELECOMMUNICATION INFORMATION: 


CC 


TELEPHONE: 650-855-0555 


CC 


TELEFAX: 650-845-4166 


CC 


TELEX: 


CC 


INFORMATION FOR SEQ ID NO: 3: 


CC 


SEQUENCE CHARACTERISTICS: 


CC 


LENGTH: 518 amino acids 


CC 


TYPE: amino acid 


CC 


STRANDEDNESS: single 


CC 


TOPOLOGY: linear 


CC 


IMMEDIATE SOURCE: 


CC 


LIBRARY: GenBank 


CC 


CLONE: 682723 


SQ 


SEQUENCE 518 AA; 52730 MW; 1281000 CN; 



Query Match 65,9%; Score 2414; DB 13; Length 518; 

Best Local Similarity 67,3%; Pred. No. 5.98e-186; 

Matches 352; Conservative 73; Mismatches 89; Indels 9; Gaps 8; 

Db 1 MGSKELLKEEDFLGSTEDRADFDQAMFPVMETFEINDPVPKKRNGGTFCMAVMAIHLILL 60 

I :| :IH:::|: |:: I I I : : II llll I 11:111 I :||: I llll 

Qy 1 MRNKKILKEDELLSETQQ-AAFHQ-I-A-MEPFEINVPKPKRRNGVNFSLAVWIYLILL 56 

Db 61 TAGTALLLIQVLNLQEQLQMLEMCCGNGSLAIEDKPFFSL-QWA-PKTHLVPRAQGLQAL 118 

iimmmiiii mmi i m n i in i i i n mm 

Qy 57 TAGAGLLWQVLNLQARLRVLEMYFLNDTLAAEDSPSFSLLQSAHPGEHLAQGASRLQVL 116 

Db 119 QAQLSWVHTSQEQLRQQFNNLTQNPELFQI KGERGSPG * P • ' KGAPGAPG I PGLPGPAAE 175 

M 1 1 : 1 1 : |:M h Mllll :|:||||:|:|| III I II II MM 
Qy 117 QAQLTWVRVSHEHLLQRVDNFTQNPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAE 176 

Db 176 KGEKGAAGRDGTPGVQGPQGPPGSKGEAGLQGLTGAPGKQGATGAPGPRGEKGSKGDIGL 235 

II III llll: I lllhll M 1 1 1 1 1 1 lllll : I M : S 1 1 M 1 1 1 || 

Qy 177 KGAKGAMGRDGATGPSGPQGPPGVKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGL 236 

Db 236 TGPKGEHGTKGDKGDLGLPGNKGDMGMKGDTGPMGSPGAQGGKGDAGKPGLPGLAGSPGV 295 

lllll M M II :ll MIM Ihllllhlll 1:11 lllll II 
Qy 237 IGPKGETGTKGEKGDLGLPGSKGDRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGA 296 

Db 296 KGDQGKPGVQGVPGPQGAPGLSGAKGEPGRTGLPGPAGPPGIAGNPGIAGVKGSKGDTGI 355 

lllll MM II || I : 1 1 1 1 1 1 ! :| || II l| :|:|| MMIIIMIM 
Qy 297 KGDQGQPGLQGVPGPPGAVGHPGAKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTGL 356 

Db 356 QGQKGTKGESGVPGLVGRKGDTGSPGLAGPKGEPGRVGQKGDPGMKGSSGQQGQKGEKGQ 415 

III I MM | II; UN lh lllll Ml Ml lllll: 

Qy 357 QGQQGRKGESGVPGPAGVKGEQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGE 416 

Db 416 KGESFQRVRIMGGTNRGRAEVYYNNEWGTICDDDWDNNDATVFCRMLGYSRGRALSSYGG 475 

MM llhhmilllllh llllllhhhll mimimiii h 

Qy 417 RGENSVSVRIVGSSNRGRAEVYYSGTWGTICDDEWQNSDAIVFCRMLGYSKGRALYKVGA 476 

Db 476 GSGNIWLDNVNCRGTENSLWDCSKNSWGNHNCVHNEDAGVECS 518 

hi llllll M I M : : 1 1 I : I ! 1 1 1 : 1 : 1 hllllllll 
Qy 477 GTGQIWLDNVQCRGTESTLWSCTKNSWGHHDCSHEEDAGVECS 519 



RESULT 6 

ID US-08-794-795-7 STANDARD; PRT; 489 AA. 
XX 

AC XXXXXX 
XX 
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DT 
XX 

DE Sequence 7, Application US/08794795 
XX 

CC Sequence 7, Application US/08794795 

CC GENERAL INFORMATION: 

CC APPLICANT: Elshourlagy, Nabil 

CC APPLICANT: Adamou, John 

CC APPLICANT: Gross, Mitchell 

CC APPLICANT: Lysko, Paul 

CC TITLE OF INVENTION: Human Macro Scavenger Rec 

CC TITLE OF INVENTION: eptor 

CC NUMBER OF SEQUENCES: 9 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: SmithKline Beecham Corporation 

CC STREET: 709 Svedeland Road 

CC CITY: King of Prussia 

CC STATE: PA 

CC COUNTRY: USA 

CC ZIP: 19406 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Diskette 

CC COMPUTER: IBM Compatible 

CC OPERATING SYSTEM: DOS 

CC SOFTWARE: FastSEQ for Windows Version 2.0 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/794,795 

CC FILING DATE: 04-FEB-1997 

CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER; ATG50009P 

CC FILING DATE: 22-MAY-1996 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Han, William T 

CC REGISTRATION NUMBER: 34,344 

CC REFERENCE/DOCKET NUMBER: ATG50009 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: 610*270-5219 

CC TELEFAX: 610-270-4026 

CC TELEX: 

CC INFORMATION FOR SEQ ID NO: 7: 

CC SEQUENCE CHARACTERISTICS; 

CC LENGTH: 489 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS; Single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 489 AA; 49441 MW; 1141296 CN; 

Query Match 64,6%; Score 2369; DB 11; Length 489; 
Best Local Similarity 69,2%; Pred, No. 3.32e-182; 



Matches 


342; Conservative 63; Mismatches 84; Indels 5; Gaps 


Db 


l 


METFEINDPVPKKRNGGTFCMAVMAIHLILLTAGTALLLIQVLNLQEQLQMLEMCCGNGS 


60 






II Mil 1 Ihlll 1 1 1 II::: ::M 1 :|::||| | : 




Qy 


26 


MEPFEINVPKPKRRNGVNFSLAWVIYLILLTAGAGLLWQVLNLQARLRVLEMYFLNDT 


85 


Db 


61 


LAIEDKPFFSL-QWA-PKTHLVPRAQGLQALQAQLSWVHTSQEQLRQQFNNLTQNPELFQ 


118 






II II 1 III 1 1 1 II 1 II 11111:11: |:|:| 1: :|:|||| :|: 




Qy 


86 


LAAEDSPSFSLLQSAHPGEHLAQGASRLQVLQAQLTWVRVSHEHLLQRVDNFTQNPGMFR 


145 


Db 


119 


I KGERGSPG - P - - KGAPGAPG IPGLPGPAAEKGEKGAAGRDGTPGVQGPQGPPGSKGEAG 


175 






llli:|:|| III 1 II II llhllll III Mil: 1 lllllll lllll 




Qy 


146 


IKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAG 


205 


Db 


176 


LQGLTGAPGKQGATGAPGPRGEKGSKGDIGLTGPKGEHGTKGDKGDLGLPGNKGDMGMKG 


235 






III lllllllllhllhllllllll II lllll IIMIUIMM 1 Mil 




Qy 


206 


LQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKG 


265 


Db 


236 


DTGPMGSPGAQGGKGDAGKPGLPGLAGSPGVKGDQGKPGVQGVPGPQGAPGLSGAKGEPG 


295 






1:1 11:11111:111 hll lllll II Mill Ihllllll 11 1 Mllllll 




Qy 


266 


DAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPG 


325 



Db 296 RTGLPGPAGPPGIAGNPGIAGVKGSKGDTGIQGQKGTKGESGVPGLVGRKGDTGSPGLAG 355 

:| II II II : : : 1 : 1 1 1 1 1 1 1 1 : 1 1 1 I Mllllll I II: 1 1 1 1 M I 
Qy 326 SAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAG 385 

Db 356 PKGEPGRVGQKGDPGMKGSSGQQGQKGEKGQKGESFQRVRIMGGTNRGRAEVYYNNEWGT 415 

III II: Mill MUM M. :: M 1 1| : I : : 1 1 1 1 1 II II : III 
Qy 386 PKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRGRAEVYYSGTWGT 445 

Db 416 ICDDDWDNNDATVFCRMLGYSRGRALSSYGGGSGNIWLDNVNCRGTENSLWDCSKNSWGN 475 

M MM M F I E 1 L 1 1 : 1 1 ! I MIMI MllhMI MINI: 

Qy 446 ICDDEWQNSDAIVFCRMLGYSKGRALYKVGAGTGQIWLDNVQCRGTESTLWSCTKNSWGH 505 

Db 476 HNCVHNEDAGVECS 489 

|:| MMIMMI 
Qy 506 HDCSHEEDAGVECS 519 



RESULT 7 

ID US-08-478-029A-66 STANDARD; PRT; 357 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 66, Application US/08478029A 
XX 

CC Sequence 66, Application US/08478029A 

CC GENERAL INFORMATION: 

CC APPLICANT: Ferrari, Franco A. 

CC APPLICANT: Cappello, Joseph 

CC TITLE OF INVENTION: Functional Recombinant^ Prepared 

CC TITLE OF INVENTION; Synthetic Protein Polymer 

CC NUMBER OF SEQUENCES: 119 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE; Flehr, Hohbach, Test, Albritton & Herbert 

CC STREET: Four Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: CA 

CC COUNTRY: US 

CC ZIP: 94111 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE; Patentln Release 11. 0, Version #1.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/478, 029A 

CC FILING DATE: 07-JUN-1995 

CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER; US 07/609,716 

CC FILING DATE: 06-NOV-1990 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 07/269,429 

CC FILING DATE; 09-NOV-1988 

CC PRIOR APPLICATION DATA; 

CC APPLICATION NUMBER: US 07/114,618 

CC FILING DATE: 29-OCM987 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 06/927,258 

CC FILING DATE; 04-NOV-1986 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Trecartin, Richard F. 

CC REGISTRATION NUMBER; 31,801 

CC REFERENCE/DOCKET NUMBER: A-55186-8/RFT/MTK 

CC TELECOMMUNICATION INFORMATION; 

CC TELEPHONE: 415-781-1989 

CC TELEFAX; 415-398-3249 

CC INFORMATION FOR SEQ ID NO: 66: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 357 amino acids 

CC TYPE: amino acid 

CC STRANDEDNESS: single 
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CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 357 AA; 31954 MW; 670322 CN; 

Query Match 21.74; Score 796; DB 8; Length 357; 

Best Local Similarity 48.4%; Pred. No. 2.12e-52; 

Matches 134; Conservative 45; Mismatches 95; Indels 3; Gaps 2; 

Db 40 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 99 

I 1:1! I I Ihlllll h II :l I II h:(l II I 

0y 148 GEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQ 207 

Db 100 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 159 

II 1:11 II I HI I I: I I I :|::i II I I :|: I I I : 

Qy 208 GPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDA 267 

Db 160 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 219 

I MM: I: I I Mill :|:ll:lll:|: I I :| llhhll III I I 

Qy 268 GVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQ-P-GLQGVPGPPGAVGHPGAKGEP 324 

Db 220 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 279 

|::|:|l :| 1 1 : 1 1 : 1 1 : I I I I :|: I II I lllll I hll : 
Qy 325 GSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLA 384 

Db 280 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAG 316 

II hll I I I::| I: I Ihlh : 

Qy 385 GPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENS 421 



RESULT 8 

ID US-08-475-411A-66 STANDARD; PRT; 357 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 66, Application US/08475411A 
XX 

CC Sequence 66, Application US/08475411A 

CC GENERAL INFORMATION: 

CC APPLICANT: Ferrari, Franco A. 

CC APPLICANT: Cappello, Joseph 

CC TITLE OF INVENTION: Functional Recombinantly Prepared 

CC TITLE OF INVENTION: Synthetic Protein Polymer 

CC NUMBER OF SEQUENCES: 119 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Flehr, Hohbach, Test, Albritton & Herbert 

CC STREET: Four Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: CA 

CC COUNTRY: US 

CC ZIP: 94111 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC -DOS/MS -DOS 

CC SOFTWARE: Patentln Release 11. 0, Version #1.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/475, 411A 

CC FILING DATE: 07-JUN-1995 

CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 07/609,716 

CC FILING DATE: 06-NOV-1990 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 07/269,429 

CC FILING DATE: 09*NOV-1988 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 07/114,618 

CC FILING DATE: 29-OCM987 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 06/927,258 

CC FILING DATE: 04-NOV-1986 



CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Trecartin, Richard F. 

CC REGISTRATION NUMBER: 31,801 

CC REFERENCE/DOCKET NUMBER: A-55186-9/RFT/MTK 

CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: 415-781-1989 

CC TELEFAX: 415-398-3249 

CC INFORMATION FOR SEQ ID NO: 66: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 357 amino acids 

CC TYPE; amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 357 AA; 31954 MW; 670322 CN; 

Query Match 21.7%; Score 796; DB 8; Length 357; 

Best Local Similarity 48.4%; Pred. No. 2.12e-52; 

Matches 134; Conservative 45; Mismatches 95; Indels 3; Gaps 2; 

Db 40 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 99 

I |:|| I I: I 11:11111 h II :| I II l::ll III I :l 

Qy 148 GEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQ 207 

Db 100 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 159 

II hll II I III I I: I I I :h:l II I I :|: I I I : 

Qy 208 GPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDA 267 

Db 160 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 219 

I Mil: I: I I lllll : 1 : 1 1 : 1 1 1 : 1 1 | | :| ||h|:|| Ml I I 

Qy 268 GVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQ-P-GLQGVPGPPGAVGHPGAKGEP 324 

Db 220 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAGPKGADGSPGPAGPVGSPGAPGPP 279 

|::|:|| :| ||:||:||: I I I I :|: I II I lllll I Ml 
Oy 325 GSAGSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLA 384 

Db 280 GPPGPPGPPGAPGPPGPPGPPGPPGLPGPKGDRGDAG 316 

II hll :| ! I h:l h I Ihlh : 

Qy 385 GPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENS 421 



RESULT 9 

ID US-08-475-411A-65 STANDARD; PRT; 408 AA, 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 65, Application US/08475411A 
XX 

CC Sequence 65, Application US/084754UA 

CC GENERAL INFORMATION: 

CC APPLICANT: Ferrari, Franco A, 

CC APPLICANT: Cappello, Joseph 

CC TITLE OF INVENTION: Functional Recombinantly Prepared 

CC TITLE OF INVENTION: Synthetic Protein Polymer 

CC NUMBER OF SEQUENCES: 119 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Flehr, Hohbach, Test, Albritton & Herbert 

CC STREET: Four Embarcadero Center, Suite 3400 

CC CITY: San Francisco 

CC STATE: CA 

CC COUNTRY: US 

CC ZIP: 94111 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentln Release #1.0, Version 11.30 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/475, 411A 

CC FILING DATE: 07-JUN-1995 

CC CLASSIFICATION: 435 



Pri Sep 11 07:51:45 1998 



US-08-794- 



795-6. rap 



Page 



CC PRIOR APPLICATION DATA; 

CC APPLICATION NUMBER: US 07/609,716 

CC FILING DATE; 06-NOV-1990 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER; US 07/269,429 

CC FILING DATE; 09-NOV-1988 

CC PRIOR APPLICATION DATA; 

CC APPLICATION NUMBER; US 07/114,618 

CC FILING DATE: 29-OCM987 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER: US 06/927,258 

CC FILING DATE: 04-NOV-1986 

CC ATTORNEY/AGENT INFORMATION: 

CC NAME: Trecartin, Richard P. 

CC REGISTRATION NUMBER: 31,801 

CC REFERENCE/DOCKET NUMBER: A- 55186 "9/RFT/MTK 

CC TELECOMMUNICATION INFORMATION; 

CC TELEPHONE: 415*781*1989 

CC TELEFAX; 415-398*3249 

CC INFORMATION FOR SEQ ID NO: 65: 

CC SEQUENCE CHARACTERISTICS; 

CC LENGTH; 408 amino acids 

CC TYPE; amino acid 

CC STRANDEDNESS: single 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 408 AA; 35237 MW; 959886 CN; 

Query Match 20,7%; Score 757; DB 8; Length 408; 

Best Local Similarity 47.61; Pred. No. 3,07e-49; 

Matches 129; Conservative 38; Mismatches 104; Indels 0; Gaps 0; 

Db 55 GPPGPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAP 114 

I 1:11 I !::! lillllllll: II I h MM ihl I 

Qv 148 GEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQ 207 

Db 115 GPPGPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPP 174 

II Ml I: hill I I: I I II I I: I I I ||: I I I : 

Qy 208 GPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDA 267 

Db 175 GPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAPGPP 234 

I Mil: I: I I Mill :| lh Mill Mill: I I II:: 
Qy 268 GVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSA 327 

Db 235 GPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPPGPP 294 

hll :l Ihlhlh I h I :| I I I :| MM I Ml 
Qy 328 GSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPK 387 

Db 295 GPPGPAGPVGSPGAPGPPGPPGPPGPPGAPG 325 

hll II I I h:l I I I I 
Qy 388 GAPGQAGQKGDQGVKGSSGEQGVKGEKGERG 418 



RESULT 10 

ID US*08*478*029A-65 STANDARD; PRT; 408 AA. 
XX 

AC XXXXXX 

XX 

DT 

XX 

DE Sequence 65, Application US/08478029A 
XX 

CC Sequence 65, Application US/08478029A 

CC GENERAL INFORMATION: 

CC APPLICANT: Ferrari, Franco A. 

CC APPLICANT; Cappello, Joseph 

CC TITLE OF INVENTION: Functional Recombinantly Prepared 

CC TITLE OF INVENTION; Synthetic Protein Polymer 

CC NUMBER OF SEQUENCES; 119 

CC CORRESPONDENCE ADDRESS: 

CC ADDRESSEE: Flehr, Hohbach, Test, Albritton & Herbert 

CC STREET: Four Embarcadero Center, Suite 3400 

CC CITY: San Francisco 



CC 


STATE: CA 


CC 


COUNTRY: US 


CC 


ZIP: 94111 


CC 


COMPUTER READABLE FORM: 


CC 


MEDIUM TYPE; Floppy disk 


CC 


COMPUTER; IBM PC compatible 


CC 


OPERATING SYSTEM: PC-DOS/MS-DOS 


CC 


SOFTWARE: Patentln Release #1.0, Version #1.30 


CC 


CURRENT APPLICATION DATA: 


CC 


APPLICATION NUMBER: US/08/478, 029A 


CC 


FILING DATE; 07-JUN-1995 


CC 


CLASSIFICATION: 435 


CC 


PRIOR APPLICATION DATA: 


CC 


APPLICATION NUMBER: US 07/609,716 


CC 


FILING DATE; 06-NOV-1990 


CC 


PRIOR APPLICATION DATA; 


CC 


APPLICATION NUMBER; US 07/269,429 


CC 


FILING DATE: 09-NOM988 


CC 


PRIOR APPLICATION DATA; 


CC 


APPLICATION NUMBER: US 07/114,618 


CC 


FILING DATE: 29-OCM987 


CC 


PRIOR APPLICATION DATA: 


CC 


APPLICATION NUMBER: US 06/927,258 


CC 


FILING DATE: 04-NOV-1986 


CC 


ATTORNEY/AGENT INFORMATION: 


CC 


NAME: Trecartin, Richard F. 


CC 


REGISTRATION NUMBER: 31,801 


CC 


REFERENCE/DOCKET NUMBER; A-55186*8/RFT/MTK 


CC 


TELECOMMUNICATION INFORMATION: 


CC 


TELEPHONE; 415-781-1989 


CC 


TELEFAX: 415-398-3249 


CC 


INFORMATION FOR SEQ ID NO: 65: 


CC 


SEQUENCE CHARACTERISTICS: 


CC 


LENGTH: 408 amino acids 


CC 


TYPE: amino acid 


CC 


STRANDEDNESS: single 


CC 


TOPOLOGY: linear 


CC 


MOLECULE TYPE: protein 


SQ 


SEQUENCE 408 AA; 35237 MW; 959886 CN; 



Query Match 20.7%; Score 757; DB 8; Length 408; 

Best Local Similarity 47.6%; Pred. No. 3,07e-49; 

Matches 129; Conservative 38; Mismatches 104; Indels 0; Gaps 0; 

Db 55 GPPGPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAP 114 

I hll I h:l lllllllllh h II I h Ihll Ihl I 

Qy 148 GEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPGVKGEAGLQ 207 

Db 115 GPPGPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPP 174 

II hll h hill I h I I II I h I I I lh I I I = 

Qy 208 GPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKGDRGMKGDA 267 

Db 175 GPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAPGPP 234 

! Illh h I I Mil lh I I II I Mil: I |: I lh: 
Qy 268 GVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPGAKGEPGSA 327 

Db 235 GPPGPPGPPGAPGPPGPPGPPGPPGPAGPVGSPGAPGPPGPPGPPGPPGAPGPPGPPGPP 294 

hll :| Ihlhlh I N:l I I hi llhl I hll M 
Qy 328 GSPGRAGLPGSPGSPGATGLKGSKGDTGLQGQQGRKGESGVPGPAGVKGEQGSPGLAGPK 387 

Db 295 GPPGPAGPVGSPGAPGPPGPPGPPGPPGAPG 325 

hll II I I h:| I I I I 
Qy 388 GAPGQAGQKGDQGVKGSSGEQGVKGEKGERG 418 



RESULT 11 

ID US-08-763-497-8 STANDARD; PRT; 1107 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 8, Application US/08763497 



Fri Sep 11 07:51:45 1998 



US-08-794-795-6.rap 



Page 



xx 

CC Sequence 8, Application US/08763497 

CC GENERAL INFORMATION: 

CC APPLICANT; Gruskin et al. 

CC TITLE OF INVENTION; Recombinant Chimeric Proteins And 

CC TITLE OF INVENTION; Methods Of Use Thereof 

CC NUMBER OF SEQUENCES: 8 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE: Patentm Release #1.0, version 11,30 (EPO) 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER: US/08/763,497 

CC FILING DATE; 11-DEM996 

CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER; 08/259,263 

CC FILING DATE: 

CC INFORMATION FOR SEQ ID NO: 8: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1107 amino acids 

CC TYPE: amino acid 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 1107 AA; 100327 MW; 5735272 CN; 

Query Match 20.7%; Score 757; DB 11; Length 1107; 

Best Local Similarity 45.1%; Pred. No. 3,Q7e-49; 

Matches 134; Conservative 47; Mismatches 113; Indels 3; Gaps 2; 

Db 289 EPGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKGPAG 348 

:|l : I I::| :| :|| I II: I h I Ml :M Ihl 

Qy 140 NPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPG 199 

Db 349 ERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPGPPG 408 

:| :l II hll I I : I I 1 1 I I 1 1 I 1 1 I I : I 1 1 : I 
Qy 200 VKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKG 259 

Db 409 PPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGPAGPAG 468 

I :l:lllll II: I: h l::l |::l III I I I I lllh I 
Qy 260 DRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPG 319 

Db 469 ERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGERGVQG 528 

:|| hlllll llll::|:|l :| I I II I I :|| I II II I 
Qy 320 AKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTG-L--QGQQGRKGESGVPGPAGVKG 376 

Db 529 PPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKGDRG 585 

I::| |::IMI I III I h:| II I I II : : ::M 
Qy 377 EQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRG 433 



RESULT 12 

ID US-08-763-497-5 STANDARD; PRT; 1169 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 5, Application US/08763497 
XX 

CC Sequence 5, Application US/08763497 

CC GENERAL INFORMATION: 

CC APPLICANT: Gruskin et al. 

CC TITLE OF INVENTION: Recombinant Chimeric Proteins And 

CC TITLE OF INVENTION; Methods Of Use Thereof 

CC NUMBER OF SEQUENCES: 8 

CC COMPUTER READABLE FORM; 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC-DOS/MS-DOS 

CC SOFTWARE; Patentln Release #1.0, Version #1.30 (EPO) 

CC CURRENT APPLICATION DATA; 



CC APPLICATION NUMBER; US/08/763,497 

CC FILING DATE: ll-DEC-1996 

CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA: 

CC APPLICATION NUMBER; 08/259,263 

CC FILING DATE; 

CC INFORMATION FOR SEQ ID NO; 5: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1169 amino acids 

CC TYPE: amino acid 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 1169 AA; 107458 MW; 6480635 CN; 

Query Match 20.7%; Score 757; DB 11; Length 1169; 

Best Local Similarity 45.1%; Pred. No. 3.07e-49; 

Matches 134; Conservative 47; Mismatches 113; Indels 3; Gaps 2 

Db 289 EPGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKGPAG 348 

:|| : I |::| :| :|l I lh I 1 1 1 1 : 1 : 1 : I: I II I -I 
Qy 140 NPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPG 199 

Db 349 ERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPGPPG 408 

:| :| II 1:11 I I :| I II I I II I II I I: I lh I 
Qy 200 VKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKG 259 

Db 409 PPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGPAGPAG 468 

I : 1 : 1 1 ' I II: I: |: |::| |::| III I I I I lllh I 
Qy 260 DRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPG 319 

Db 469 ERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGERGVQG 528 

:|| hlllll lllh:hll :| I I I I I I HI I II II I 
Qy 320 AKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTG-L-QGQQGRKGESGVPGPAGVKG 376 

Db 529 PPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKGDRG 585 

|::| |::MII I III I hH II I I II = = 
Qv 377 EQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRG 433 



RESULT 13 

ID US-08-763-497-6 STANDARD; PRT; 1171 AA. 
XX 

AC xxxxxx 

XX 

DT 

XX 

DE Sequence 6, Application US/08763497 
XX 

CC Sequence 6, Application US/08763497 

CC GENERAL INFORMATION: 

CC APPLICANT: Gruskin et al. 

CC TITLE OF INVENTION: Recombinant Chimeric Proteins And 

CC TITLE OF INVENTION; Methods Of Use Thereof 

CC NUMBER OF SEQUENCES: 8 

CC COMPUTER READABLE FORM: 

CC MEDIUM TYPE: Floppy disk 

CC COMPUTER: IBM PC compatible 

CC OPERATING SYSTEM: PC -DOS/MS -DOS 

CC SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

CC CURRENT APPLICATION DATA: 

CC APPLICATION NUMBER; US/08/763,497 

CC FILING DATE: ll-DEC-1996 

CC CLASSIFICATION: 435 

CC PRIOR APPLICATION DATA; 

CC APPLICATION NUMBER: 08/259,263 

CC FILING DATE: 

CC INFORMATION FOR SEQ ID NO: 6: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1171 amino acids 

CC TYPE: amino acid 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

SQ SEQUENCE 1171 AA; 107798 MW; 6537396 CN; 
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Query Match 20.7%; Score 757; DB 11; Length 1171; 

Best Local Similarity 45,1%; Pred, No. 3,07e-49; 

Matches 134; Conservative 47; Mismatches 113; Indels 3; Gaps 2; 

Db 289 EPGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKGPAG 348 

:M : I :| :l! I I I 1 1 1 1 : 1 : 1 : h I III :|| ll:| 
Qy 140 NPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPG 199 

Db 349 ERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPGPPG 408 

I :| II I:|| I I :M II I I II I II I I: I il: I 
Qy 200 VKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKG 259 

Db 409 PPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGPAGPAG 468 

I II: I: h |::| h:l III I I II lllh I 

Qy 260 DRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPG 319 

Db 469 ERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGERGVQG 528 

:|| hlllll llll-hll :| i I I I I I :|| I II II I 
Qy 320 AKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTG-L-QGQQGRKGESGVPGPAGVKG 376 

Db 529 PPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKGDRG 585 

|::| |::IMI I III I |::| II I I II : : ::|| 
Qy 377 EQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRG 433 



xxxxxx 



RESULT 14 

ID US-08-187-319A-18 
XX 
AC 
XX 
DT 
XX 
DE 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 



STANDARD; PRT; 1341 AA, 



Sequence 18, Application OS/08187319A 

Sequence 18, Application US/08187319A 
GENERAL INFORMATION; 
APPLICANT: Ovist, Per 
APPLICANT: Bonde, Martin 

TITLE OF INVENTION: A Method for Assaying Collagen Fragments 
TITLE OF INVENTION: in Body Fluids, A Test Kit and Means for Carrying Out the 
TITLE OF INVENTION: Method and Use of the Method to Diagnose the Presence of 
TITLE OF INVENTION: Disorders Associated with the Metabolism of 
NUMBER OF SEQUENCES: 21 
CORRESPONDENCE ADDRESS: 
ADDRESSEE: Darby & Darby PC 
STREET: 805 Third Avenue 
CITY: New York 
STATE: New York 
COUNTRY; USA 
ZIP: 10022 
COMPUTER READABLE FORM: 
MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release tl.0, Version 11.25 
CURRENT APPLICATION DATA: 
APPLICATION NUMBER: US/08/187 , 319A 
FILING DATE: 21-JAN-1994 
CLASSIFICATION: 436 
ATTORNEY/AGENT INFORMATION: 
NAME: Gogoris, Adda C 
REGISTRATION NUMBER: 29,714 
REFERENCE/DOCKET NUMBER: 4305/08701 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 212-527-7700 
TELEFAX: 212-753-6237 
TELEX: 236687 
INFORMATION FOR SEQ ID NO: 18: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 1341 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 



CC MOLECULE TYPE: protein 

CC ORIGINAL SOURCE: 

CC ORGANISM: Homo sapiens 

CC IMMEDIATE SOURCE: 

CC CLONE: COLLAGEN ALPHA 1 (I) 

SQ SEQUENCE 1341 AA; 128257 MW; 8816128 CN; 



Query Match 20.7%; 
Best Local Similarity 45.1%; 
Matches 134; Conservative 



Score 757; DB 5; Length 1341; 
Pred. No. 3.07e-49; 

47; Mismatches 113; Indels 3; Gaps 2; 



Db 326 EPGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKGPAG 385 

I : | |::| :| :|| | ||: | ||||:|:|: |: I III :|| IN 
Qy 140 NPGMF R I KG EQG APG LQG HKG AMGMPG APG PPG PPAEKG AKG AMGRDG ATG PSG PQG P PG 199 

Db 386 ERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPGPPG 445 

:| :| II I I :| I II I I II I II I h I 11= I 
Qy 200 VKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKG 259 

Db 446 PPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGPAGPAG 505 

I ihlllll lh I: I: l::l l-l III I M I lllh I 
Qy 260 DRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPG 319 

Db 506 ERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGERGVQG 565 

:M hlllll lllh:hll I I I I I :|| I II II I 

Qy 320 AKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTG-L--QGQQGRKGESGVPGPAGVKG 376 

Db 566 PPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKGDRG 622 

h: h:|||| | III | |::| II I I II : : ::|l 
Qy 377 EQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRG 433 



xxxxxx 



Sequence 18, Application US/08963825 

Sequence 18, Application US/08963825 
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CC TELECOMMUNICATION INFORMATION: 

CC TELEPHONE: 212-527-7700 

CC TELEFAX: 212*753-6237 

CC TELEX: 236687 

CC INFORMATION FOR SEQ ID NO: 18: 

CC SEQUENCE CHARACTERISTICS: 

CC LENGTH: 1341 amino acids 

CC TYPE: amino acid 

CC TOPOLOGY: linear 

CC MOLECULE TYPE: protein 

CC ORIGINAL SOURCE: 

CC ORGANISM: Homo sapiens 

CC IMMEDIATE SOURCE: 

CC CLONE: COLLAGEN ALPHA 1 (I) 

SQ SEQUENCE 1341 AA; 128257 MW; 8816128 CN; 

Query Match 20.7%; Score 757; DB 13; Length 1341; 

Best Local Similarity 45. H; Pred. No. 3,07e-49; 

Matches 134; Conservative 47; Mismatches 113; Indels 3; Gaps 2; 

Db 326 EPGPVGVQGPPGPAGEEGKRGARGEPGPTGLPGPPGERGGPGSRGFPGADGVAGPKGPAG 385 

:M : I h:l :l Ml I I I :|:|; |: | || | :|| ||:| 
Qy 140 NPGMFRIKGEQGAPGLQGHKGAMGMPGAPGPPGPPAEKGAKGAMGRDGATGPSGPQGPPG 199 

Db 386 ERGSPGPAGPKGSPGEAGRPGEAGLPGAKGLTGSPGSPGPDGKTGPPGPAGQDGRPGPPG 445 

:| :l II hll I I : I I 1 1 I I 1 1 I 1 1 I I : I 1 1 : I 
Qy 200 VKGEAGLQGPQGAPGKQGATGTPGPQGEKGSKGDGGLIGPKGETGTKGEKGDLGLPGSKG 259 

Db 446 PPGARGQAGVMGFPGPKGAAGEPGKAGERGVPGPPGAVGPAGKDGEAGAQGPPGPAGPAG 505 

I II: h h |::| MM III I I I I MM: | 

Qy 260 DRGMKGDAGVMGPPGAQGSKGDFGRPGPPGLAGFPGAKGDQGQPGLQGVPGPPGAVGHPG 319 

Db 506 ERGEQGPAGSPGFQGLPGPAGPPGEAGKPGEQGVPGDLGAPGPSGARGERGFPGERGVQG 565 

Ml Ml ||||::|:M I I I I Ml I II II I 

Qy 320 AKGEPGSAGSPGRAGLPGSPGSPGATGLKGSKGDTG-L-QGQQGRKGESGVPGPAGVKG 376 

Db 566 PPGPAGPRGANGAPGNDGAKGDAGAPGAPGSQGAPGLQGMPGERGAAGLPGPKGDRG 622 

MM I : : 1 1 1 1 I III I MM II I I II : : 
Qy 377 EQGSPGLAGPKGAPGQAGQKGDQGVKGSSGEQGVKGEKGERGENSVSVRIVGSSNRG 433 
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